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ABSTRACT 


We used 3T fMRI and 306-channel MEG to study whether the posterior and anterior sources of the Nlm response differentially adapt to sound 
location (“where”) and identity (“what”) information, respectively. Subjects were presented with pairs of the Finnish vowels /as/ and /o/ simulated 
from locations straight ahead or 45 degrees to the right. Either the phonetic or spatial attribute changed within the sound pair, with the other attribute 
held fixed. The adaptation of responses to the second sound (Test Stimulus) was investigated as a function of physical resemblance to the first 
(Adaptor). Second, we investigated how selective attention modulates this adaptation. Subjects pressed a button upon hearing two consecutive pairs 
identical with respect to the target attribute, which changed in successive blocks. The posterior Nlm source activity was strongly adapted irrespective 
of the phonetic similarity/dissimilarity between Test Stimulus and Adaptor. fMRI/MEG and MEG source-modeling, however, revealed strong 
activation in areas posterior to primary auditory cortex, when Test Stimulus differed spatially from Adaptor. The anterior Nlm activity, in turn, was 
enhanced after phonetic changes. Selective attention enhanced these feature-specific tuning effects. Results suggest that neuron ensembles in 
posterior auditory cortex may be specifically tuned to sound-location cues. Selective attention might modulate the feature tuning in auditory cortex. 
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INTRODUCTION 


Anterior and posterior areas of the primate auditory cortex may be differentially tuned to processing of object/content ('whaf) and sound location 
(‘ where ’) information [Rauschecker et al. 2000], but this effect has not yet been conclusively demonstrated in humans. There is also disagreement on 
whether auditory space is represented topographically or in distributed fashion [Belin et al. 2000]. Recent cat studies, suggesting distributed coding, 
however, seem to go along with the notion that neurons in posterior auditory fields might be specifically suited for spatial processing [Stecker et al. 
2003]. Further, these distributed neuronal units were found to have a roughly topographical organization. Our recent fMRI/MEG studies suggest that 
feature-specific adaptation in different cortical auditory regions might reveal their functional specialization in humans [Jaaskelainen et al. 2004]. 
Adaptation is the phenomenon thought to underlie response suppression of serially-presented sounds, based on local adaptation, and lateral inhibition 
that spreads across the feature-topographic cortical region [May et al. 1999]. The degree of adaptation, as a function of the difference between a test 
stimulus and preceding adaptor, may thus provide an experimental measure of tuning properties in a specific cortical region. Here, we studied 
whether the posterior and anterior sources [Lu et al. 1992, Sams et al. 1993] of the neuromagnetic Nlm response differentially adapt to sound 
location and phonetic category, respectively. The effects of selective attention on response tuning properties were also studied. 

METHODS 

Fig. 1 shows the tasks and stimuli. Seven healthy 
right-handed native Finnish speakers (age 21-44 
years, 2 females) with normal hearing were 
measured with 306-channel MEG (passband 0.01- 
172 Hz at 600 Hz; Elekta Neuromag Ltd., Finland) 
in a magnetically shielded room [Cohen et al. 

2002]. Human subjects’ approval was obtained from 
the local institution. Voluntary consents were signed 
before each measurement. At least 110 artifact-free 
2000 ms (200 ms baseline) epochs, including 
responses to each tone in a pair (Fig. 1), were 
averaged off-line (0.3^40 Hz bandpass) for each 
condition of interest. T1-weighted 3D anatomical 
MRI scans were used for individual boundary 
element models. Whole-head 3-T fMRI (Siemens 
Trio, Erlangen, Germany) was recorded to bias the cortically constrained depth-weighted minimum-norm estimates (MNE) in 5 subjects. A sparse- 
sampling gradient-echo fMRI BOLD sequence was used (TE=30 ms, flip angle 90°, TR=10.2 s; 20 axial 5 mm slices, 0.5 mm gap, 3.125 x 3.125 mm 
in-plane), with the scanner’s coolant pump switched off. Sound pairs were presented 8.35, 5.15, and 1.95 seconds before each (N=2\l) echo-planar 
imaging volume acquisition. The MEG epochs were pooled to three classes according to the relative attribute change in Adaptor vv. Test Stimulus, 
separately for each attention condition. A loose orientation constraint (source covariance matrix entries corresponding to the current component 
normal to the cortex set = 1 and the transverse components = 0.4), with source locations 90% weighted by significant fMRI activations, was utilized 
in the inverse-solution calculations. Equivalent current dipoles (ECD) were fit to the ascending and descending phase of the adaptor Nlm, and 
entered into time-varying multi-ECD models to explain the Test Stimulus responses (same model across all conditions). A three-way ANOVA with 
contrasts (Huynd-Feldt correction) tested the influence of the factors hemisphere, attention, and changing attribute on the Test Stimulus ECDs. 

RESULTS 

Combined fMRI/MEG estimates (Fig. 2) suggested that the regions posterior to the primary auditory cortex might be more strongly activated when 
Test Stimulus is spatially dissimilar to Adaptor. The time-varying multi-ECD models provided robust statistical support for this result (Fig. 3). 


Figure 1. Subjects were presented with pairs of the Finnish vowels /ae/ and 
/o/ (duration 300 ms) simulated from locations straight ahead or 45 degrees 
to the right of the subject’s head. The gap between the first (Adaptor) and 
the second (Test Stimulus ) sound was 250 ms. Stimuli were simulated by 
convolving raw vowel recordings with acoustic impulse responses measured 
at the ears of a manikin head to generate stereo stimuli with appropriate 
phonetic (‘ whaf ) and spatial (‘ where ’) attributes. Either the phonetic or 
spatial attribute changed between Adaptor and Test Stimulus, with the other 
attribute (or occasionally both attributes) held fixed across the pair. During a 
one-back task, subject was instructed to press a button upon hearing two 
consecutive pairs identical with respect to the target attribute (represented on 
dark background, Left and Right). The target attribute, prompted with a 
visual cue, alternated in consecutive blocks. In three 20-min MEG sessions, 
60-sec Attend Location (Left) and Attend Phoneme (Right) blocks were 
interleaved with 30-sec unattended blocks. In the 38-min fMRI sessions, all 
these blocks, and an additional rest condition, were 30 sec in duration. 
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Consistent with previous observations [Sams et al. 1993], the responses to Adaptor were explained by earlier posterior (Left hemisphere, 90 ms, {x, y, 
z}={-50, 4, 50 mm}; Right hemisphere, 85 ms, {x, y, z}={52, 5, 53 mm}) and later anterior (Left hemisphere, 116 ms, {x, y, z}={-47, 12, 51 mm}; 
Right hemisphere 118 ms, {x, y, z}={51, 16, 51 mm}) ECDs. For each subject, the respective individual ECDs were entered into the time-varying 
models to explain responses to Test Stimulus. Robust posterior N1 source activation emerged when Test Stimulus came from a different direction 
than Adaptor (Fig. 3), but this source activity strongly adapted when the phonetic category changed but location was held fixed. This effect was 
significant in both left and right hemispheres, as suggested by a the main effect Stimulus Change in the 3-way ANOVA of the posterior N1 source 
amplitude (F 2 ,6 = 13.4,/><0.01). The anterior N1 source, in turn, was slightly enhanced when Test Stimulus was phonetically dissimilar to Adaptor, as 
suggested by the 3-way ANOVA (F L8 6 = 4.2, p <0.05). The anterior and posterior sources appeared to be specifically modulated by attention. The 
posterior N1 activity for location changes was significantly larger in Attend Location than Attend Phoneme condition in the right hemisphere (4=3.0, 
<0.05). The anterior N1 activity for phoneme changes was, in turn, larger in Attend Phoneme than Attend Location condition in the left hemisphere. 
The ANOVA main effect of attention was significant for the anterior N1 source (F 2 y=6.1,p <0.05). 

DISCUSSION 

Results indicate that posterior 
auditory-cortex activity to spatial 
stimulation is considerably less adapted 
by preceding sounds coming from a 
different direction than the same 
direction. This could be explained by 
sharper feature-tuning at this region, 
resulting in less lateral inhibition and 
larger responses. Previous results 
[Jaaskelainen et al. 2004] have shown 
that regions anterior to the primary 
auditory cortex might, in turn, be more 
specifically tuned to properties like 
sound frequency, which led us to 
hypothesize that the anterior Nlm 
generators might overlap the putative 
auditory “what” pathways. Here, the 
indices of phoneme-specific tuning of 
the anterior Nlm activity were not as 
clear as the results regarding sound 
location and posterior regions. However, 
group statistics suggested slightly 
pronounced activation for phonetic vs. 
directional changes at this region, tentatively supporting the idea of an anterior “what” stream. Interestingly, the tuning properties of neurons 
underlying anterior and posterior Nlm activity appeared to be modulated by selective attention. Statistically, the differences were significant in the 
right hemisphere posterior area for the enhanced tuning of sound location processing and in the left hemisphere anterior areas for sharpening of the 
phone representations. This left-right hemispheric difference in attentional modulation, however, was very slight. 

In conclusion, the present result suggests that neural ensembles occupying regions posterior to the human primary auditory cortex may be 
specifically tuned on sound-location cues, which supports previous animal findings [Rauschecker et al. 2000]. Further, selective attention seemed to 
sharpen the feature tuning of auditory cortex neurons. 
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ABSTRACT 

We found an activity specific to deviant correspondences of audio-visual information arising only when the visual stimuli preceded the auditory 
stimuli by the moderate time differences with the presence of attention directed to the stimuli, which was similar to the mismatch field (MMF). Since 
the relation of the elicitation mechanism between this activity and the MMF still remains to be unclear, magnetoencephalography (MEG) was used to 
examine if the activity involves a memory-trace process. The animation of three keyboards with three kinds of tone bursts adjusted to correspond to 
each other was employed. The left and right sides of the corresponding pairs were designed as the congruous paired (CP) stimuli, and the pairs 
inverting that correspondence as the incongruous paired (IP) stimuli, whose frequencies were adjusted to be equal. Whenever a left or right pair was 
delivered, a congruous middle pair (MP) as the perceptual control stimulus always followed it. Two experiments with the intentional use/nonuse of 
vision-based prediction were conducted. As a result, like the case for MMF, the component over Nlm for the IPs was bilaterally elicited under the 
prediction condition, whereas it was not recognized under the non-prediction condition. Since the patterns of crossmodal information cannot be 
traced due to the equal frequencies of presentations in each correspondence, it is concluded that the vision-based prediction found to create the 
template explicitly, thereby awaiting the incoming sound to be compared. 

KEY WORDS 

Mismatch field (MMF), Magnetoencephalography (MEG), Vision-based prediction 

INTRODUCTION 

To date, we have investigated an activity specific to deviant correspondences of audio-visual information in the oddball paradigm, and found a 
component evoked over Nlm that appeared only when the visual stimuli preceded the auditory stimuli by the moderate time differences with the 
presence of subjects’ attention directed to the stimuli [Aoyama, 2003], which was similar to the mismatch field (MMF) that is automatically elicited 
by deviant stimuli embedded in a sequence of standard background in the auditory modality [Hari, 1984] (electroencephalographic study for 
[Naatanen, 1978]). As the model for MMF, a memory-trace process, in which the template pre-attentively formed by repetitive standard sounds in the 
auditory sensory memory is compared with the deviant novel information that differs in terms of various attributes (e.g. frequency, intensity, and 
duration [Rosburg, 2003]) in order to detect sound changes, is adopted. It was also reported that a mismatch response was evoked in the case of 
complex sounds, suggesting that multiple information is simultaneously traced in this system [Brattico, 2002]. Despite these facts, it still remains to 
be unclear whether the activity elicited by the deviant correspondences is due to the memory-trace process or the other process based on the 
discordances between audio-visual information. Thus, it is assumed that this activity is not elicited for the incongruous pairs with equal frequencies to 
the congruous ones if crossmodal information is simultaneously traced. In this study, magnetoencephalographic (MEG) recording was performed 
during delivery of the audio-visual paired stimuli in order to clarify the elicitation mechanism of the activity by means of investigating the relation 
between this activity and the MMF. 

METHODS 

Eight healthy subjects (21- to 26-year-old males) participated in all of the experiments as volunteers. All subjects had normal visual and auditory 
functions and relative pitch, and one of them had absolute pitch. 

Audio-visual paired stimuli were delivered to these subjects. For the visual stimuli, the animation of three keyboards was adopted, as shown in 
Figure la. As the auditory stimuli, 500-, 1000-, or 1250-Hz tone bursts at a 65-dB sensation level that were adjusted to correspond to the left, middle, 
or right keyboard respectively were computationally produced. Synchronized with the instance of a keyboard reaching the bottom position, the 
auditory stimulus began to be delivered and the trigger signal was generated (all the latencies in this study are based on this moment). Details of a 
stimulus sequence are shown in Figure lb. Five kinds of paired stimuli were presented. The 
left and right sides of the corresponding pairs were designed as the congruous paired (CP: CPI 
and CP2, respectively) stimuli, and the pairs inverting that correspondence as the incongruous 
paired (IP: IP1 and IP2, respectively) stimuli, whose frequencies were adjusted to be equal 
(P = 0.125). Each time a left or right pair was delivered, a congruous middle pair (MP) as the 
perceptual control stimulus always followed it (P = 0.5), although the order of CPs and IPs 
was randomly arranged. An example of presentation order is shown in Figure lc. 

The subjects were told to perform two sorts of tasks: (1) a prediction task in which the 
subject intentionally attempted to predict the pitch of an incoming sound as soon as the left or 
right keyboard began to move, and to perform the matching of audio-visual information, (2) a 
non-prediction task in which the subject attempted to perform the matching after the delivery 
of a sound without any intentional prediction. The order of the performed tasks was 
counterbalanced across subjects. Under each task, the reaction time (RT) measurements and 
MEG recordings were conducted. In the RT measurements, subjects were instructed to press a 
button with their right forefingers as soon as IPs were presented under each condition for the 
purpose of the enhancement of task performance. The audio-visual stimuli were delivered at 
least 640 times for each task without any neurophysiological recording. The RT data were 
analyzed for every 40 IPs. Based on the RT data for the last 40 IPs, only the subjects that 
satisfied the criterion of the accuracy of button presses > 0.95 and the criterion of the 



MP CP2/IP2 MP 



Latency [ms] 

Figure 1 . (a) An illustration of three keyboards 
with a fixation point used for presenting visual 
stimuli, (b) A presentation order of visual 
stimuli, (c) Details of a stimulus sequence. 
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Figure 2. Results of the individual 
and total RTs for IPs in eight 
subjects nS 1-8 □. 


difference between the adjacent averaged RTs < 15 ms were permitted to advance to the MEG 
recordings, and the others continued the RT measurements. A two-way repeated-measures analysis of 
variance (RMANOVA) was performed on the RT data. 

As for the MEG recording, data were recorded with a 440-channel whole-cortex MEG system 
(PQ2440R, Yokogawa Electric Corp., Japan). The sampling rate was 625 Hz and the analogue recording 
passband was 0.03-200 Hz. Every experiment had 640 presentations that consisted of 4 blocks, and 
there was a break of 30 s between each block. After the rejection of artifact-contaminated epochs, the 
collected data were averaged over the period of -500-550 ms. The offset was removed based on the 
interval of 200 ms before the onsets of visual stimuli, and the averaged data were digitally low-pass 
filtered at 40 Hz. In order to cancel modality-specific activities, the differential waveforms were 
obtained by subtracting the averaged magnetic fields for IPs from those for CPs for each channel i: 

B m = fg /P i(0 + BlP2 (0) ~ {B C p\(i) + ^CP 2 (0} 

£> dijf V ) 


Subsequently, the root-mean-square (RMS) waveforms for each hemisphere were calculated from the 
differential magnetic fields. 


RESULTS 

The individual and total RTs in all subjects are summarized in Figure 2. The main effect of 
performed tasks showed significance (F(l,7) = 123 ,p < 0.01), whereas that of delivered stimuli 
(W) < 1) and their interaction (/ 7 '( 1,7) < 1) were not significant. As for the MEG data, root- 
mean-square (RMS) differential waveforms obtained by subtracting the averaged waveforms for 
IPs from those for CPs in a typical subject who does not have absolute pitch are shown in 
Figure 3a. Although no early component was observed under the non-prediction condition, the 
component over Nlm, named Dl, was recognized in the period of 100-250 ms bilaterally under 
the prediction condition. Seven out of eight subjects showed Dl under the prediction condition. 

The mean peak latencies and the corresponding RMS values of Dl was 138 ms and 55 fT. 

Figure 3b shows an iso-contour map corresponding to Dl, indicating that Dl represents the 
activity in the vicinity of the primary auditory cortex (Al). 

DISCUSSION 

The RT measurements proved that subjects could detect the left and right audio-visual 
discordances evenly, and that a process facilitating the detection of discordances, preparing for 
the presentation of a sound, exists. 

The results of the MEG data suggest that the component Dl under the prediction condition is 
similar to the MMF in that it was observed over Nlm and the peak latencies of Dl preceded 
those of Nlm in the vicinity of Al. In contrast with the case for MMF, it is elicited under the 
prediction condition, but not elicited under the non-prediction condition, which is similar to the 
component recognized in our previous study. Moreover, absolute pitch is irrelevant to the 
elicitation. Since the occurrence frequencies of CPs and IPs were adjusted to be equal, the 
elicitation of Dl is not due to the frequencies or the past presentations of stimuli, suggesting that 
a memory-trace process for crossmodal information cannot exist. Consequently, the vision-based 
prediction seems to create the template explicitly, which is compared with the incoming sound to 
elicit Dl. Although the effect of top-down control on the mismatch system is controversial 
[Rinne, 2001], this template might be identical with that of MMF. By exploiting the differences 
of information levels being processed, the incongruous audio-visual information is detected in the early auditory information processing without any 
direct retention of the higher-order visual information in the auditory sensory memory. 
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Figure 3. (a) RMS differential waveforms 
obtained by subtracting the magnetic fields 
for IPs from those for CPs. (b) An iso¬ 
contour map corresponding to the component 
Dl observed under the prediction condition. 
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ABSTRACT 

Automatic auditory processing of two-voice polyphonic music in musicians and musically naive subjects was investigated using the mismatch 
negativity (MMN) response derived from MEG recordings. Deviant notes occurred either in the high or in the low voice, and went either outside the 
key of the melody or remained within the key. MMN obtained from musicians for all conditions were larger than in controls. Responses were larger 
for changes in the higher voice and for within-key changes. The results suggest that polyphonic melodies are encoded independently in auditory 
sensory memory, but that the representation of the higher voice melody is more robust. 

KEY WORDS 

Mismatch negativity, MEG, auditory memory, melody encoding, polyphonic music. 

INTRODUCTION 

Western polyphonic music contains multiple simultaneous melodic voices of equal importance. Previous studies have shown that a single melody 
is encoded automatically, as indexed by the mismatch negativity (MMN) in electroencephalography (EEG) [Trainor, 2002]. Using magneto¬ 
encephalography (MEG), larger MMNm was found in musicians than non-musicians [Fujioka, in press] suggesting that musical experience enhances 
the formation of melodic memory traces. The MMNm component was used to investigate whether multiple simultaneous melodies in polyphonic 
music are encoded as independent auditory memory traces or whether they are encoded as a single percept in musicians and non-musicians. 

METHODS 

Ten musicians (5 females, age 20-35) having more than 10 years of formal 
musical education and ten non-musicians (4 females, age 23-34) with no 
musical training outside regular school education participated in the study. 

Polyphonic stimuli consisted of two simultaneous melodies (A and B), each 
composed of five 300-ms piano notes (Fig. 1). In the HALB condition, melody 
A was higher than melody B, while in the HBLA condition B was higher than 
A. An oddball paradigm was used to investigate MMNm. In each condition, 

50% of the melodies ended with the standard terminal (Fig. 1). Four deviant 
trial types (each 12.5%) were presented in which the deviant note was either in 
the high voice (HD) or the low voice (LD), and either in key (1) or out of key 
(2) to create the HD1, HD2, LD1, and LD2 deviants (Fig. 1). In each condition 
500 trials were binaurally presented separated by 900 ms inter-melody interval 
at 60 dB above individual sensation threshold. 

Magnetic fields were recorded with a 151-channel whole-cortex 
magnetometer (OMEGA, CTF Systems Inc, Port Coquitlam, Canada). The 
MEG signals were band-pass filtered between 0.1 and 100 Hz and sampled at a 
rate of 312.5 s' 1 . The duration of a recording epoch was 2.35 s including a 0.4-s 
pre-stimulus interval. Magnetic field data were averaged selectively for 
standard and deviant types in each condition after rejecting eye-artifact 
contaminated epochs. The analysis technique of source space projection (SSP) 
was applied to the MEG data, which combined the multi-channel magnetic 
field data into a single time-series of magnetic dipole moment. Grand averaged 
source waveforms across each groups of subjects were obtained for the 
standard and deviant stimuli, and for the difference between these two. The baselines were adjusted to the mean in a 100 ms interval prior to the onset 
of the deviation. The 99% confidence intervals for the grand averaged waveforms were estimated from nonparametric bootstrap re-sampling analysis 
to identify time intervals with amplitudes significantly different from zero. The individual MMNm amplitudes were derived as mean value within a 
40-ms interval around the peak latency of grand average. A repeated measured ANOVA was applied for statistics analysis MMNm amplitudes 
(between-subject factor: group, within-subject factor; melody-voice combination, voice, and tonality). 

RESULTS 

MMNm was present in musicians for all types of deviants, whereas the MMNm in non-musicians did not reach significance for some out-of-key 
conditions (Fig. 2, 3). MMNm was significantly larger in musicians than in non-musicians (P<0.05). The MMNm was larger for in-key than for out- 
of-key deviations across groups (P<0.01). The interaction between voice and group was significant (P<0.05) with the high voice MMNm larger than 
the low voice MMNm in musicians (P<0.05) while non-musicians did not show this distinction. In the mean, the MMNm latencies were longer for 
low deviants (160 to 230 ms) than for high deviants (135 to 185 ms) in both groups. Although no main effect of lateralization was found across 
groups, musicians had larger MMNm for tonality violation in the left hemisphere and controls had larger MMNm in the right hemisphere (P<0.05). 
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Figure 1. The two melodies (A and B) are presented 
simultaneously in two melody-voice combinations (HALB and 
HBLA) with four types of deviant terminals for each 
polyphonic condition varying voice and tonality (HD1: high, 
in-key change; HD2: high, out-of-key; LD1: low, in-key 
change; LD2: low, out-of-key). 
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DISCUSSION 

The larger MMNm in musicians than in non-musicians supports the idea 
that long-term musical training enhances automatic auditory memory traces for 
melodies. Furthermore, the different lateralization between the two groups to 
tonal violations is likely related to experience-induced change from 
predominantly right [Patel, 1998] [Peretz, 1979] [Tervaniemi, 2000] to left 
hemispheric dominance in harmonic processing. MMNm was generally larger 
and earlier for changes in the high than in the low voices; although the small 
amplitude of the MMNm in non-musicians did not allow this trend to reach 
significance, the latency effect was present in both groups. This suggests that a 
more robust memory trace is formed for the higher melody, in agreement with 
the behavioural studies showing superior recognition of higher melody change 
in both musicians and non-musicians [Crawley, 2002]. 

It is somewhat puzzling that the in-key change produced larger MMNm in 
both groups compared to out-of-key change despite superior behavioural 
detection of the latter. However, the lack of larger MMNm for out-of-key than 
within-key changes is consistent with previous findings [Trainor, 2002], and is 
likely due to the physical interval size difference whereby the in-key change 
was a whole tone and the out-of-key change was a semitone (Fig. 1). This 
suggests that MMNm is very sensitive to the size of pitch differences. 

Perhaps of most interest is the fact that MMNm was found for changes in 
both high and low voices despite the fact that changes occurred on 50% of 
trials, supporting the idea that separate memory traces are formed for the two 
simultaneous melodies. In general, when a standard occurs 50% of the time and 
two deviants each 25% of the time, no MMNm is found. We confirmed this in 
a separate study [Fujioka, in prep.] in which a single tone (50% of trials) and 
two deviant tones (25% each) occurred. An enhanced N1 was found for 
deviants, but no MMNm. The fact that MMNm did occur in the present study 
suggests that the melodies were encoded in separate traces and that each 
memory trace was affected by stimulus probabilities within its own stream. 
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HALB condition Left Hemisphere Right Hemisphere 



Figure 2: The grand-averaged source waveforms of MMNm 
from musicians and non-musicians in the HALB condition. 
The time scale refers to the onset of the 5th note. Thick lines 
represent the median (n=10) of the MMNm response and thin 
lines the 99% confidence limits for estimated baseline noise. 
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Figure 3: Group results obtained in the HBLA condition 
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Analysis of modulation of MEG during auditory attention to frequency 

by frequency-tagged neuromagnetic responses 

Hasegawa T, Owaki T, Amano K, Takeda T 

Dept, of Complexity Science and Engineering, Graduate School of Frontier Science, The University of Tokyo, Japan 

ABSTRACT 

Modulation of neuromagnetic fields while listening to sounds with different auditory attentions was investigated. Auditory stimulus was 
continuous sound that consisted of high and low tone. Each tone was amplitude-modulated by different frequencies. Neuromagnetic fields were 
recorded by a 440-channnel MEG system while subjects were listening to the stimulus with both ears. There were three attentional conditions; Attend 
High, Attend Low, and control condition. In Attend High condition subjects attended to only high tone of stimulus, and in Attend Low condition they 
attended to only low tone, and they attended to neither of them in controll condition. MEG data was analyzed with power spectra and it was found 
that a large number of channels had peaks at both modulation frequencies. Distribution of channels that showed peaks looked like one that made by 
magnetic sources in the neighborhood of auditory cortices of both hemispheres. The two peaks were used as tags for the responses to each tone and 
the changes of their power strength by auditory attention change was analyzed. At the tag frequency of the low tone, peaks were larger in Attend Low 
condition than in Attend High condition. Though that result was consistent with our anticipation, such result couldn’t be obtained at the tag frequency 
of the high tone. More verification is needed in consideration of tonotopic organization in auditory cortex. 

KEY WORDS 

Magnetoencephalography, Auditory cortex, Attention, Amplitude Modulation, Steady-state fields, Power spectrum. 

1. INTRODUCTION 

We can distinguish the elements of complex sounds each other, even if they reach our ears at the same time. When one of them is important and 
the others aren’t, we attend to only the important one. As a result, we perceive the attended element is enhanced, and the others fade out. Like this, 
our perception can change by attention, even if the input sounds don’t change. 

There’re many studies of the mechanisms of selective attention in human brain. Magnetoencephalography (MEG) has been used as one of the 
effective measurement methods in its non-invasiveness and high time resolution. One of the studies with MEG was reported that event-related fields 
(ERFs) were enhanced by attention [Woldorff, 1993]. This result seems to consistent with the change of our perception. By the way, in the most of 
reports including the previous one, subjects attended to the auditory stimulus in the specific location (left/right). As we know, we can attend to the 
pitch of sounds. 

In this study, MEG was recorded and analyzed in order to investigate a relationship between changes of perception by the effect of attention and 
neuromagnetic responses. The auditory stimulus consisted of two tones that had different pitch. Subjects were instructed to attend to higher tone, 
lower tone, or neither of them in each measurement. To distinguish responses to each tone, they were amplitude-modulated by different frequency. 
This method used a property that a sharp peak appeared at modulation frequency in power spectrum of MEG while amplitude-modulated sounds 
were inputted to ears. 

2. METHODS 

2.1 Subjects 

Four subjects were participated in this study. All of them were right-handed and male. They were 23-28 years old, and had no history of 
neurological disorders. 

2.2 Stimulation 

Auditory stimulus consisted of two tones. One of them was 3000Hz tone amplitude-modulated by 
40.1Hz (high tone), and another was 700Hz tone modulated by 30.1Hz (low tone). The depth of their 
modulation was 80%. Several combinations of carrier and modulation frequencies were tested before this 
study. In consideration of results of the test and some previous studies [Pantev, 1996. Fujiki, 2002], the 
carrier frequencies and the modulation frequencies were chosen. Subjects were listening to the stimulus 
with both ears through plastic tubes. Sound intensity was 60-70 Sound Pressure Level. 

2.3 Data acquisition 

Neuromagnetic fields were recorded by a 440-channel whole-head MEG system (Yokogawa Electric 
Corporation, Japan) in a magnetically shielded room. There were three attentional conditions. Subjects 
were instructed to attend to high tone in the Attend High condition, and to attend to low tone in the Attend 
Low condition. In addition, in control condition, they were listening with attending to neither of them. 

MEG data were recorded for 300 seconds in each condition. Sampling frequency was 1000Hz and band¬ 
pass filtered (0.3 - 500Hz). Auditory stimulus was sounded continuously during each measurement. 

2.4 Data analysis 

Fast Fourier transform (FFT) spectra were calculated across 8192 samples. FFT window was moved in steps of 4096 samples and 72 power 
spectra were averaged. The strengths of power at each modulation frequency were used for tags of each tone. Changes of each tag’s power caused by 


power 



Fig. 1. A drawing that shows 
“Pt” and “Pb”. 
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differences in attentional condition were investigated. To select the channels that were influenced by the auditory inputs, the ratios Pt/Pb at each tag 


frequency were calculated in each channel. “Pt” means the power of the 
top of the peak, and “Pb” means the power of the base of peak (see 

Fig-1)- 

First, we analyzed the peak at the tag frequency of low tone (30.1Hz). 
The channels whose Pt/Pb was larger than 1.3 in control condition were 
selected as the channels that influenced by the auditory stimulus. Pt/Pbs 
in Attend High condition of selected channel were averaged. So were 
Pt/Pbs in Attend Low condition. Pt/Pbs in Attend High and Attend Low 
were normalized by those in control condition. Secondly, the peak at the 
tag frequency of high tone (40.1 Hz) was analyzed in the same way. 

The value of Pt/Pb was mainly analyzed rather than strength of the 
peak, because there was the possibility that the strengths near the peak 
were decreased. 

3. RESULTS 


power[tT'2./Hz] 


Power spectra of some channels were showed clear peaks at each 
modulation frequency (Fig.2). The peaks at the tags of low tone were 
larger than those of high tone in all subjects. 

One representative distribution of Pt/Pbs at modulation frequency 
(30.1Hz) was showed in Fig.3. The lighter area means the larger Pt/Pbs (sharp peaks). 

The averaged Pt/Pbs were showed in Fig.4 with the average of four 
subjects. The gray bars are in Attend Low, and the white bars are in 
Attend High condition. At the tag frequency of low, all the subjects 
showed some increases in Attend Low condition. 

4. DISCUSSION 

In this study, we presupposed that the power strengths of each 
modulation frequency would change according to the difference of tone 
which subjects attended. The channels that had a sharp peak (large Pt/Pb) 
were distributed like two magnetic sources existed in the neighborhood 
of bilateral auditory cortex (Fig.3). Thus, it was able to think that the 
channels that had a sharp peak at the tag frequency were influenced by 
the auditory stimulus. Therefore, only such channels were selected and 
averaged (Fig.4). At the tag frequency of low tone, the Pt/Pbs were 
bigger in Attend Low condition than Attend High condition. On the other 
hand, two of four subjects showed larger ratio in Attend High condition 
than in Attend Low condition. 

At the low tone tag, the results were consistent of our anticipation. 

Tonotopic organization had some probability for the reason that the 
anticipated results couldn’t be obtained. In human auditory cortex, 
sources for the higher frequency represented more deeply with in the 
supratemporal sulcus. So magnetic field amplitude made by auditory 
inputs with higher frequency are weaker than with lower frequency 
[Pantev, 1996]. It is obvious in Fig.2. So it was probable that accurate 
responses to high tone couldn’t observe than responses to low tone. It is 
needed to retry in consideration of that point. 



Fig.2. A power spectrum of one representative 
channel (Subject CL 



(b) 


Fig.3. A distribution of ratio of top to bottom of the 
peak at 30.1 Hz (control condition), (a): Two- 
dimensional map. (b): View from left side of head. 
The lighter area means larger ratio. 
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30.1 Hz (Low tone) 


Ave. 
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40.1 Hz (High tone) 


Ave. 


□ Attend High 

□ Attend Low 


Fig.4. Comparison of the averaged ratios of each 
subjects. Left: at 30.1Hz, Right: at 40.1Hz. Ave. 
means average of four subjects. The values are 
normalized with control condition in each channel. 
Error bars refer to SEM of each condition in each 
subject. 
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The Influence of Sleepiness on the Auditory Evoked Magnetic Field 

Natsuko Hatsusaka, Masanori Higuchi 
Applied Electronics Laboratory, Kanazawa Institute of Technology, Japan 


ABSTRACT 

Although there have been many studies of the auditory magnetic field in a state of wakefulness, there were few studies about the influence of 
sleepiness. In this study, we measured the auditory evoked magnetic fields in wakeful and half-sleep conditions, and discussed the influence of 
sleepiness. Twelve subjects with normal hearing ability (eight men, four women) participated in this study. Magnetic fields were recorded while 
subjects lay down inside a magnetically shielded room using a 320 channel whole-head MEG system. Auditory stimulation was 1000Hz tone burst of 
50ms duration. The responses were averaged by 100 stimuli. A significant response was observed after the Nlm response in a half-sleep condition. 
We obtained the similar results from 6 of the subjects. This response was estimated to come from the deep area (around cingulate gyrus). We assume 
it has some relation to the theta oscillation, and is synchronized with auditory stimulation. 

KEY WORDS 

MEG; Sleepiness; Auditory evoked magnetic field; Nlm; P2m; 

INTRODUCTION 

There have been many studies of the auditory evoked magnetic field. Most of the studies show Nlm and P2m responses are typically observed in 
a wakeful condition. The magnetic fields of these responses are distributed in superior temporal cortex. From these fields, the sources are estimated in 
primary auditory cortex. However it is not clear how the auditory evoked magnetic field is affected by sleepiness. In this condition, subjects feel very 
sleepy and keep on fighting with sleepiness, and sometimes have dream-like feelings. We define this condition as a half-sleep condition. In this study, 
we measured the auditory evoked magnetic fields in wakeful and half-sleep conditions, and discussed the influence of sleepiness. 

METHODS 

Twelve subjects (eight men, four women) with normal hearing ability participated in this study. Informed consent was obtained from each subject 
before the experiment. Auditory stimulation was 1000Hz tone burst of 50ms duration and delivered to the right ear through an air tube attached to an 
earplug. We used 3 interstimulus intervals of 1000ms, 1500ms and 2000ms. Auditory evoked signals were averaged by 100 stimuli. In order to make 
a half-sleep condition, the measurement was started about 1 hour after having lunch. 

Magnetic signals were recorded in a magnetically shielded room using a 320-channel whole-head MEG system developed by Kanazawa Institute 
of Technology [Higuchi, 2003]. Subjects lay down on the bed, and were instructed to stare at a fixed point. After the measurements, we asked the 
subjects about their sleepiness during the measurements. 

RESULTS 

We found clear responses in both wakeful and half-sleep conditions as 
shown in Figure 1. Especially the second peak in the half-sleep condition 
had very large amplitude. It was not observed in the wakeful condition. We 
confirmed this peak was not an artifact caused by eye movement. We named 
this peak “BIKKURI response”. BIKKURI is a Japanese word and means 
surprising or waking up. We obtained the similar results from 6 of the 
subjects. As shown in Figure 2, the BIKKURI responses were clearly 
observed in the raw data, which was not averaged by auditory stimuli. In 
this figure, vertical dash lines show auditory stimulus onsets. We confirmed 
the BIKKURI response was synchronized with auditory stimulus and was 
not magnetocardiogram. The isofield contour map of the first peak in the 
wakeful conditon was almost the same as the map of the first peak in the 
half-sleep condition, which shows a typical field pattern of the Nlm 
response. This means the source of the Nlm response doesn’t change by 
sleepiness. Figure 3 shows the isofield contour maps of the second peak in 
the wakeful condition and the BIKKURI response. The field patterns were 
clearly different. The map of the wakeful condition is similar to the reversed 
pattern of the Nlm response and confirms this peak is the P2m response. On 
the BIKKURI response, the map suggested the sources were located in 
different area than the hearing cortex. Actually, the dipole sources were 
estimated in the vicinity of the cingulate gyrus as shown in Figure 4. We 
also observed the BIKKURI response with the interstimulus interval of 
1500ms. Therefore, the BIKKURI response does not seem to be influenced 
by interstimulus interval. 

DISCUSSION 

Chiara M. Portas et. al reported that the N1 response of the auditory 
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Figure 1. Averaged waveforms of auditory evoked 
magnetic fields in wakeful and half-sleep conditions. 
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evoked potential wasn’t influenced by sleep [Chiara, 2000]. In our study the Nlm response also didn’t change by sleepiness. 

We observed the significant response, which was named “BIKKURI response”, after the Nlm response in a half-sleep condition. The sources 
were estimated in the deeper area than the auditory cortex. Asada et. al reported the current sources of the theta rhythm activities observed in sleeping 
condition were estimated in the deep brain area [Asada, 2000]. Their study suggests the BIKKURI response have some relation to the theta rhythm 
activities. Generally, the theta rhythm activities are spontaneous, and are not generated with regularity. Therefore, the components of the activities 
should be distributed uniformly in a recording window by means of averaging process. However, our results showed they were not distributed 
uniformly, and observed at certain latency significantly. It is difficult to identify the BIKKURI response as a part of theta rhythm activities using 
frequency analysis. 

We classified epochs of the raw data in a half-sleep condition under two groups. One is the BIKKURI response appeared clearly, the other is it 
was unclear. We averaged the data and compared the Nlm responses of the two groups. The former group was larger than the latter group. This 
suggests awareness level of the epoch in which the BIKKURI response appeared was higher than the other epoch in the half-sleep condition. The 
BIKKURI response may be a trigger to awake the brain. 
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Figure 2. Raw MEG date with auditory stimuli in a half-sleep condition. 



Figure 4. Dipole estimation of the BIKKURI response observed in a half-sleep 
condition. 
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Figure 3. Isofield contour maps of the 
second peak in a wakeful conditon (a) 
and the BIKKURI response (b). 
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ABSTRACT 

In a recent EEG investigation [Johnson et al, 2001], we described a novel late negative ERP component associated with binaural processing of 
auditory pitch based solely on interaural timing differences (“dichotic pitch”), an acoustic phenomenon that is closely analogous to visual perception 
of stereoscopic depth based on retinal disparities. The present study extends this research with neuromagnetic recordings of auditory evoked fields 
(AEFs) elicited by dichotically-embedded pitches. Eight healthy adult subjects listened to control stimuli consisting of 500 ms segments of 
broadband acoustic noise presented identically to both ears via earphones, and dichotic pitch stimuli created by introducing a dichotic delay to a 
narrow frequency region of the same noise segments and resulting in a perception of a pitch lateralized to the left or right of auditory space. Auditory- 
evoked fields (AEFs) were recorded using a 151 channel whole-head MEG system. Comparison of control and dichotic-pitch AEFs showed reliable 
amplitude differences during a time window of 150-350 ms. AEFs over the left hemisphere showed larger effects for contralateral than ipsilateral 
pitches, while the right hemisphere showed no differences for differently lateralized sources. The results indicate a relatively late stage of neural 
processing of binaurally-derived cues for the perceptual segregation of concurrent sound sources and support a right-hemisphere dominance for the 
processing of sound-source localization. 

KEY WORDS 

Auditory perception; Acoustic stimulation; Brain mapping; Evoked potentials, auditory; Magnetoencephalography; Pitch perception; Sound 
localization. 


INTRODUCTION 

“Dichotic pitch” is a psychoacoustic illusion involving the perception of pitches from broadband noise stimuli that contain no monaural cues to 
pitch. When listeners are presented with two noise processes with interaurally-identical amplitude spectra, but with a specific interaural lag over a 
narrow frequency band, the dichotically-shifted frequencies will be perceptually segregated from the noise, and the resulting pitch has a tonal quality 
associated with the center frequency of the dichotically-delayed portion of the spectrum. The perceived location of the pitch is determined by whether 
the interaural lag occurs in the left or the right ear. This illusion provides a convenient experimental protocol for the study of auditory mechanisms 
involved in the perceptual segregation and spatial localization of concurrent sound sources based on interaural timing cues. 

In the present study we measured acoustically-evoked brain responses with magnetoencephalography. Our objectives were to: (1) identify neural 
responses associated with the perception of dichotic pitch; and (2) determine if these responses exhibit any hemispheric lateralization corresponding 
to the perceived location of the dichotic pitch. 

METHODS 

Eight healthy right-handed adult subjects (six males; mean age 35, range 24-45) participated in the experiment. Control stimuli were two 
independent 500 ms broadband Guassian noise processes of 500 ms duration. Dichotic pitch stimuli were produced using a complementary filtering 
technique [Dougherty et al., 1998]. A dichotic delay of 0.45 ms was introduced to the noise processes for a frequency band of 575-625 Hz, with the 
temporally-advanced stimulus presented to either the right ear (producing a pitch percept lateralized to the left) or the left ear (producing a percept 
lateralized to the right). Stimuli were generated on two channels of a 16-bit A-D converter (National Instruments Corp, Austin, Texas USA) PCI 
6052E) and delivered with insert earphones (Etymotic Research Inc., Elk Grove Village, Illinois USA) at a level of 70 dB SPL. Each of the three 
stimulus types (control, right-lateralized pitch, left-lateralized pitch) was presented in eight blocks of 32 trials, for a total of 256 trials per stimulus 
type. Within a block stimuli were presented at a random interstimulus interval with limits of 900 and 1100 ms. Blocks of each stimulus type were 
randomly interleaved during the session. 

Whole-head MEG recordings were made using a 151-channel magnetometer (CTF Systems, Inc., Port Coquitlam, Canada). Data were sampled 
at 625 Hz with a 0-200 Hz bandpass filter. Each trial was 800 ms duration, including 300 ms prior to stimulus onset. Averaged evoked magnetic 
fields were filtered with a bandpass of 1-30 Hz. 

RESULTS Left Temporal Right Temporal 


Compared to responses evoked by control 
stimuli, both types of dichotic pitch stimuli evoked 
responses with a prominent decrease in amplitude 
during a time window of about 150-350 ms (Figure 
1). The timing of the dichotic pitch effect concurs 
with effects observed in event-related potential 
recordings using the same acoustic paradigm 
[Johnson et al., 2003]. To test for statistical 
significance of these effects, AEF amplitudes during 
the 150-350 ms time window were rectified and 
averaged over 42 sensor positions over the temporal 
region in both hemispheres. ANOVAs confirmed 



Control 

Pitch 



Figure 1. AEFs evoked by control and dichotic pitch (left lateralized) stimuli for subject 3. 
Time scale on horizontal axis is 100 ms/div. Amplitude scale on vertical axis is 50 fT/div, 
positive flux is upward. 
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that both types of pitch ERFs had significantly decreased amplitudes in comparison to control ERFs (p < 0.01 for both). 

To examine hemispheric differences in processing of dichotic pitch, rectified AEF 
amplitudes for the 150-350 ms time window were averaged over 21 sensor positions over the 
temporal region in each of the two hemispheres and mean values were expressed as per cent 
changes from the control condition. Figure 2 shows that, for the left hemisphere, changes were 
significantly greater for contralateral than ipsilateral stimuli (F(l,7) = 11.4, p = 0.01). In 

contrast, contralateral/ipsilateral differences were not significantly different for the right 
hemisphere (F(l,7) < 1). 

DISCUSSION 

The present results confirm that magnetic recordings index a relatively late (150-350 ms) 
stage for neural processing of the dichotic pitch stimuli. Since the control and pitch stimuli 
used here were discriminable solely by a dichotic delay in the dichotic pitch stimulus, we can 
be confident that this response reflects neural processing that is dependent upon binaural fusion 
within the auditory system. The relatively late-latency consequences of binaural processing 
obtained in the present study are consistent with results obtained with event-related potential 
recordings using comparable stimuli [Johnson et al., 2003]. This result supports the conclusion 
that the magnetic and electrical recordings are indexing a comparable phase of information 
processing - a relatively late stage of auditory processing that is dependent upon binaural 
fusion of ITDs within the auditory system. An important point of difference, however, is that 
the magnetic responses to dichotic pitch are manifest as a decrease in surface field amplitudes, 
presumably reflecting a decrease in neural activity, while the ERP responses were manifest as 
an increase in negativity, presumably reflecting an increase in neural activity. This dissociation indicates that the electrical and magnetic recordings 
may be sensitive to different configurations of brain mechanisms involved in processing dichotic pitch, and thus may provide complementary rather 
than redundant information about these sources. 

The MEG results indicate that the two hemispheres respond differently to the location information available in the pitch stimuli: the left 
hemisphere exhibited a greater response to contralateral than to ipsilateral stimuli, while the right hemisphere responded equally to both contralateral 
and ispilateral stimuli. This result parallels findings from a MEG study of mismatch responses to simple linguistic stimuli [Kaiser et al., 2000], 
showing that left hemisphere responses have significantly longer latencies to ipsilateral stimuli, while right hemisphere responses have equal 
latencies for both contralateral and ipsilateral stimuli. Taken together, these findings and the present results support the interpretation that the right 
hemisphere may be dominant for processing of sound-source localization. 

Because the dichotic pitch illusion results in the perception of two concurrent sound objects - a centrally-located noise and a lateralized pitch - 
this phenomenon is relevant to current investigations of brain mechanisms involved in the perceptual segregation of sound sources [Bregman, 1990]. 
In particular, the neural processing described here and in Johnson et al. [2003] resembles the object-related negativity (ORN) elicited at a comparable 
latency when subjects perceive a mistuned harmonic as a separate perceptual object from a complex sound composed of multiple harmonics [Alain et 
al, 2002]. The present results indicate that dichotic pitch may be a useful paradigm for the study of hemispheric specializations that may be involved 
in perceptual segregation of concurrent sound sources. 



Flemisphere 

Figure 2. Mean change (from control) in amplitude of 
evoked magnetic fields as a function of perceived 
stimulus laterality and hemisphere. 
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Enhanced P2m Responses to Musical Tones in Musicians 

Kanda S., Hirata, Y., and Kuriki, S. 
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ABSTRACT 

We recorded neuromagnetic responses from 10 musicians and 10 non-musicians (right-handed males, 19-26 years old). Stimuli were 19 single 
piano tones and 12 major chords with a duration of 625 ms. Five identical tones/chords were successively presented (total duration 3125 ms) as a 
stimuli in each trial and offset-to-onset intervals between the trials varied 1000-1050 ms. Nlm and P2m were elicited by the first tone/chord. The 
P2m amplitude was larger for the musicians than for the non-musicians. The P2m amplitude showed a sound-order effect: it was larger for the 1st 
tone/chord than for the rest of tones/chords. This sound-order effect was significantly smaller for the musicians than for the non-musicians: a 
distinctive P2m was observed for all the successive tones/chords in the musicians, whereas the P2m was greatly reduced for 2nd to 5th tones/chords 
in the non-musicians. Moreover, this sound-order effect was smaller for the chords than for the tones and was also smaller in the right than the left 
hemisphere in the musicians. Such differences in the sound-order effect between the hemispheres and between single tones and chords were not 
observed for the non-musicians. 

KEY WORDS 

Auditory cortex, neural plasticity, musical training, musical chords, piano tone, hemispheric difference 

INTRODUCTION 

Physiological and anatomical differences in the auditory cortex between musicians and non-musicians have been reported for a decade [Hirata, 
1999][Pantev, 2001][Schlaug, 2001]. A recent EEG study by Shahin et al.(2003) reported that P2 elicited by pure, piano and violin tones was larger 
for musicians than for non-musicians. Further, the P2 was much larger for the piano and the violin tones than for the pure tones in the musicians. The 
differences in the P2 amplitude between the musicians and the non-musicians were thus much larger for the piano and the violin tones than for the 
pure tones. In contrast to these characteristic of P2, the vertex N1 did not differ between the musicians and the non-musicians. This finding indicates 
that P2 rather than N1 is correlated with highly trained music abilities in musicians. In addition, the P2 amplitude was increased in non-musician 
subjects by laboratory trainings using speech sounds [Tremblay, 2001][Atienza, 2002]. 

In the present study we compared neuromagnetic responses to chords between musicians and non-musician. A chord consists of several notes of 
different pitches separated by specific pitch intervals. When chords are played in sequence, the notes on their top constitutes a melody. The rest of the 
notes with lower pitch function as accompanying bass. Musicians can process tunes in a sensitive way when a musical sequence is played in chords 
than in isolated tones because internal representation of chords and chord progression is available. They can also attend all notes played 
simultaneously in chords. In contrast, non-musician do not have such internal representation and they can attend only a part of the note in chordal 
sequence; i.e., melody line. Hence it is presumed that, while musicians process chords and single notes in different ways, non-musicians process them 
rather equally. Given that the P2 component is more sensitive to acoustic complexity in musicians than in non-musicians [Shahin et al., 2003], P2m 
would be larger for chords than for isolated tones in musicians but not in non-musicians. In order to examine this hypothesis, we recorded the 
magnetic counter part of P2, P2m from musician and non-musicians. 

METHODS 

Subjects were 20 right-handed males (19-26 years old). Ten of them had started piano lessons before age of 7 (musicians) and have been 
continuing music activities since then. The rest of the subjects did not have any experience of music training (non-musicians). The stimuli were 19 
single piano tones (A2 to E flat 4 separated by semi tone) in a tone session and 12 chords (major triads, the root of each triad ranged from A2 to A flat 
3 separated by semi tone) in a chord session. In each trial, five identical sounds were successively presented for a total duration of 3125 ms and these 
stimuli were presented at offset-to-onset intervals of 1000-1050 ms both in tone and chord sessions. The duration of each sound was 625 ms. The 
stimuli were bilaterally presented at an intensity level of 67-70dB SPL(A). To ensure that subjects pay attention to the stimuli during the sessions, 
they were asked to count covertly rare oddball stimuli consisting of one deviant-pitch sound and four identical sounds. The rare oddball stimuli were 
inserted at a probability of 10% within standard stimuli constisting of the five identical sounds. Neuromagnetic responses elicited by each sound of 
the stimuli were recorded using a 204-channel whole head system (Neuromag Inc, Helsinki). The responses to the single tones and the chords were 
separately averaged. The responses to the oddballs were excluded. The mean value of the signal in a 500-ms period prior to the onset of the first 
sound was used as the baseline in each channel. The band pass filter was set at 1-20 Hz. Epochs with signal variations exceeding 4.0 pT in amplitude 
were excluded from averaging. The number of averaging was more than 300 epochs for each session. 

RESULTS 

Fig 1 shows averaged neuromagneitc responses for five successive single tones and successive chords recorded from the right hemisphere of a 
musician and a non-musician. Nlm and P2m peaks were distinctively observed for the first sound in both subjects. Both Nlm and P2m were greatly 
reduced in amplitude for 2nd to 5th sounds in a non-musician, while the P2m was well preserved in the musician. We selected 26 channels centered 
over the auditory cortex where a P2m response to the first sound was larger than the other channels in each hemisphere. These responses were root- 
moon-squared (RMS) to represent the magnitude of the P2m. Fig. 2 shows the grand mean RMS for each hemisphere of the musicians and of the 
non-musicians. The musicians showed P2m for all the five sounds, whereas the non-musicians showed a clear P2m only for the first sound. A 4-way 
ANOVA was performed for the RMS amplitude at P2m peak with factors of group (musicians/non-musicians), sound (single tone/chord), hemisphere 
(left/right) and sound order (1st to 5th). The main effects of group (p<0.005), hemisphere (p<0.05) and sound order (p<0.0001) were significant. 
Post-hoc tests using Scheffe procedures revealed that the amplitude for the 1st sound was larger than those for the other sounds. In order to evaluate 
the sound-order effect, we normalized the RMS amplitude of the P2m for 2nd to 5th sounds by that of 1st sound. A 4-way ANOVA was performed for 
the normalized data with factors of group (musicians/non-musicians), sound (single tone/chord), hemisphere (left/right) and sound order (2nd to 5th). 
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The main effects of group (p<0.05), sound (p<0.005) and hemisphere (p<0.01) were significant. The interaction between group and sound order was 
also significant. A subsequent post-hoc 3-way ANOVA with factors of sound, hemisphere and sound order was separately performed for the data of 
each subject group. The main effects of sound (p<0.005) and hemisphere (p<0.05) were significant for the musicians. In contrast, these effects were 
not significant for the non-musicians. 

DISCUSSION 

In the present study we observed differences in P2m amplitude between the musicians and the non-musicians in cortical magnetic responses to 
musical sounds. In accordance with an EEG study of Shahin et al.(2003), the P2m was larger for the musicians than for the non-musicians. P2m 
showed a sound-order effect: it was reduced for 2nd to 5th sounds compared with the 1st sound. For the musicians, the P2m showed a less sound- 
order effect. Moreover, the sound-order effect was smaller for the chords than for the single tones and also was smaller in the right than in the left 
hemisphere in them. In contrast, an amount of the sound-order effect between the hemispheres and between the single tones and the chords did not 
differ in the non-musicians. The observed robustness of the P2m elicited by the chords in the right hemisphere suggests that the neural activities 
reflected by P2m might be related to processing of acoustically complex sounds elaborated by long-term musical training with high specificity. 
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Fig.2 Grand mean RMS waveforms for the musicians and non-musicians. 


Table 1 The RMS 


amplitude of the P2 peak normalized by the P2m amplitude to the 1 st sound. 


Group 

Sound 

Hemisphere 

1st 

2nd 

Order 

3rd 

4th 

5 th 

Musician 

Chords 

Left 

1 

0.72 

0.72 

0.69 

0.67 



Right 

1 

0.95 

0.97 

0.99 

0.87 


Single tones 

Left 

1 

0.59 

0.60 

0.55 

0.51 



Right 

1 

0.74 

0.73 

0.76 

0.68 

Non-Musician 

Chords 

Left 

1 

0.47 

0.43 

0.47 

0.48 



Right 

1 

0.60 

0.55 

0.60 

0.51 


Single tones 

Left 

1 

0.47 

0.40 

0.41 

0.43 



Right 

1 

0.56 

0.50 

0.51 

0.45 


Fig. 1 Averaged cortical magnetic responses from 
the right hemisphere of a musician and a non-musician. 


510 





































P4-1 


Auditory neural activity during period of vertigo induced by cold caloric stimulation: 
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ABSTRACT 

The aim of this study was to investigate neural activity during period of vertiginous sensation, induced by caloric stimulation. My question is that 
MEG pattern in a patient with vertigo (rotatory sensations) has the same abnormality as dizziness (non-rotatory sensations). We tried caloric 
vestibular stimulation (CVS) by cold water of five volunteers before the MEG measurement. Auditory evoked magnetic fields (AEFs) during the 
subsequent period of vertiginous sensations were measured by magnetoencephalography. Current-arrow maps (CAMs) were produced to estimate the 
rotation degree in CAM. We found that rotation value (dirot) is lower than those of dizzy patients. These findings suggest that there is a different 
mechanism between dizzy and vertigo. 

KEY WORDS 

Magnetoencephalogram, caloric stimulation, vertigo, current-arrow map 


INTRODUCTION 

We have first found an abnormal auditory-magnetic-field (AEF) response in patients with chronic dizziness (CD), which was defined [Oe 2002a] 
as abnormal feelings of floating, swaying and/or non-rotatory dizziness persisting for more than 6 months [Kandori 2002b] [Oe 2002a]. The 
abnormal response produced the rotational current distribution on the temporal lobe. 

However, whether or not the abnormality also appear in the patients with vertigo was not clear. To understand the abnormality in vertiginous 
patients, we have measured the AEF signal during the post-CVS period of vertiginous sensations [Kandori 2003]. 


METHODS 

The AEF signals of the five volunteers (age: 30+10) were measured three times (before the CVS, 2-4 min after the CVS, and 4-6 
min after the CVS; see Fig. 1) from the left hemisphere of each volunteer. We used a bandpass filter (0.1-50 Hz), notch filter (50 Hz), 
and amplifier circuit to measure the AEF signals by using a burst of sound (1 kHz, 90 dB, center stimulus frequency: 0.7 Hz). The 
current distribution at highest-amplitude waveform (NIOOm) was visualized and used to calculate rotational value (« dI rot ). 


RESULTS 

Typical signals and CAMs from a volunteer who experienced a strong vertiginous sensation before and after caloric irrigation are 
shown in Fig. 2. The signals are the AEF and ENG as averaged by using the acoustic stimulation trigger. The CAMs are for NIOOm, 
which is the highest magnitude in the AEF waveforms. The difference between the AEF signals and CAMs before and after CVS is 
slight, in spite of the subjects’ sensation of rotation. 

The quantitative values, dI roh before and after the CVS are shown in Fig. 3. Results for the three volunteers (open circles) with the 
stronger induced vertiginous sensation (stronger nausea) show a tendency for dl wt to increase just after CVS. However, the highest 
value of the open circles is still within the earlier reported range for normal elderly subjects [Oe, 2002]. On the other hand, values 
(solid circles) for the two volunteers who only experienced mild dizziness showed no change during the measurements. 


DISCUSSION 

The new parameter ( dl r0t ) is more sensitively indicates dizziness (non-rotatory sensation) than vertigo. We can thus conclude that 
the main cause of the dizzy feeling in CD is a dysfunction of the balance perception mechanism in the auditory cortex. These findings 
suggest that the auditory cortical region may play an important role in body-balance perception of floating sensations. 
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Fig. 1 Protocol of AEF measurement and CVS stimuli. 



Fig. 3 Protocol of AEF measurement and CVS stimuli. 
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Fig. 2 Results before and after CVS 
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ABSTRACT 

A new approach to understand neural dynamics underlying the generation of auditory evoked magnetic field is proposed. Transfer functions are 
evaluated by using a blind identification method based on feedback system theory to get a dynamical understanding of brain auditory functions. Their 
impulse functions are also examined. 

KEY WORDS: Magnetoencephalogram, auditory evoked magnetic field, blind identification, blind signal separation, feedback system, transfer 
function, impulse response 

INTRODUCTION 

A new approach to understand neural dynamics underlying the generation of auditory evoked magnetic field (AEF) is proposed, in which we use a 
method of blind identification of transfer functions (TFs) based on feedback system theory as mentioned in [Kishida, 1996] [Kishida, 2003a]. In the 
conventional MEG analysis, averaged waveforms have been used to investigate cortical representation of the human cognitive processes, however, 
there remain much more dynamic properties included in MEG time series data still to be analyzed. Especially, much attention has to be paid to 
extract dynamic information of associated activities between regions, which may be included in MEG time series. Here we show a new approach to 
investigate brain dynamics in terms of TFs between cortical regions. 

Before applying our method of blind identification of TFs, MEG time series data were temporally decorrelated by using a blind signal separation 
(BSS) method [Kishida, 2003a] [Kishida, 2003b]. Two BSS components were selected from their stochastic structure of periodic AEF. The remixing 
matrix was applied to the two selected components to retrieve AEF MEG signals. To investigate inter-cortical dynamics of auditory brain functions, 
TFs between two positions as shown in Figure 1 were calculated from the viewpoint of statistical inverse problem. TFs of AEF were calculated to 
infer neural dynamics underlying the auditory brain responses. Their impulse functions were also evaluated. 

METHODS 

1 kHz pure tone bursts with the sound intensity of 80 dB SPL, which were made by Model SMP-4100 (Nihon-Koden Co.) and repeated once every 
2.194 seconds (0.4556Hz), were delivered to the right ear of a healthy subject through a plastic tube of 1 m in length. MEG signals generated by 130 
auditory trials were sampled at 700 Hz. MEG signals related to AEF can be extracted by using the temporal decorrelation method of BSS. In this 
method, two BSS components were selected in reference to periodical peaks of autocorrelation functions and power spectral density functions, since 
the periodic nature of the auditory stimuli used in this study can be utilized to characterize the MEG components evoked by them. That is, MEG time 
series, which were generated by the auditory stimuli presented in the frequency of 0.4556Hz, were temporally decorrelated to form two BSS 
components with peaks at 0.4556Hz and its higher harmonics in power spectrum density, before applying our blind identification of the TFs. 

Since MEG time series data were measured by 61 pairs of planar gradiometers of Neuromag 122™ system; each pair measures the two independent 
tangential derivatives of the magnetic field component normal to the helmet-shaped surface approximating the shape of the scalp, the principal 
component for each sensor pairs was calculated, which approximately represents the 
intensity of the cortical activity just below the sensor location. We selected 
measurement variable yi(n) for the principal component of two dimensional data 
(69ch 70ch) at the position PI in Figure 1, and also y 2 (n) for that of (107ch 108ch) 
at P2. The TFs between these two positions are calculated from the viewpoint of 
statistical inverse problem. Let two variables yi(n) and y 2 (n) be described by a 
following feedback system model with TFs F b 12 (z' 1 ) from P2 to PI and F b 21 (z _1 ) 
from PI to P2; 


where z" 1 is the backward time shift operator and n is discrete time. 

In the case of positions PI and P2, a minimum phase innovation model of two 
variables yi(n) and y 2 (n) was identified from principal components of AEF MEG 
time series data by the following steps: 

1) calculation of correlation functions from two principal components, 

2) singular value decomposition of Hankel matrix of which elements were 
arranged by correlation function matrices , 

3) selection of a stable solution of matrix Riccati equation, 

4) determination of coefficient matrices of innovation model as mentioned in 
Appendix A of [Kishida, 2003a],. 

Then, a closed loop TF matrix was obtained from the minimum phase innovation 
model of two variables yi(n) and y2(n), which can be rewritten into the 
representation of feedback system structure: 




Figure 1. Location of MEG SQUID sensors, and odd 

or even number channel indicates di?. 0 r dB r sensor. 

dx dy 
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y\(.n) = F n (z~ l )y 2 (n) + F x (z~ x )y x {n) 
y 2 (n) = F 2 , (z _1 )y t (n) + F 2 (z~ l )y 2 («) 

where y(n) is the innovation. Finally, TFs F 12 (z' 1 ) and F 2 i(z _1 ) were calculated 
from the closed loop TF matrix by our blind identification of TFs based on 
feedback system theory under the identifiable condition of feedback structure 
[Kishida, 1996]. 

RESULTS 

Averaged MEG signals showed clear N1 peaks at around 100 ms in 
latency (cf. Figure 4, solid lines). The bode diagrams of identified TFs F^z' 1 ) 
and F 2 i(z' 1 ) between the positions PI and P2 were shown in Figure 2 by solid 
and broken lines, respectively. In this figure, bode diagrams, which were 
derived from seven innovation models with different number of singular 
values of Hankel matrix, were overlaid. The estimation of the TFs was robust 
regardless of identified innovation models of different sizes, which indicates 
the reliability of the results obtained by using the proposed method. 

Their impulse response functions of seven identified TFs are shown in 
Figure 3. Seven impulse responses from left hemisphere (PI) to right 
hemisphere (P2) were overwritten, and had constant negative values. Those of 
inverse direction had oscillations in plus and negative values. However, it is 
concluded that all changes of impulse responses decay within about 40msec. 

DISCUSSION 

We proposed a new approach to investigate brain dynamics in terms of the 
TFs between cortical regions, and applied the technique to estimate TFs 
between a specific pair of MEG channels measuring auditory evoked brain 
responses. This method can be applied to any pair of MEG channels as long as 
the minimum phase condition is held. 

In this study, we calculated the TFs between MEG data measured at two 
sensor locations instead of those in the source space, since dynamic property 
in the MEG sensor space when using planer gradiometer system is expected to 
represent the cortical activity just below the sensor location. If two positions 
are close to each other, transformation of the sensor space signal into source 
space activity by using an appropriate inverse procedure will be required to 
dissociate the neural activity in one location from another.. 

Finally, let us compare the AEF MEG averaged waveforms with ones 
predicted by using identified innovation model of order 13 in order to check 
goodness of identified TFs. The solid line in the upper part of Figure 4 is an 
averaged waveform av(y0 of the principal component y x at PI. The broken 
line is a predicted waveform given by F 12 (z' 1 )av(y 2 ). The prediction error is 
defined by 5y 1 :=av(y 1 )-F 12 (z' 1 )av(y 2 ). The normalized error value of 
(5yi/av(y!)) 2 was calculated as 0.70128. In the lower part of Figure 4 the 
normalized error value of5y 2 :=av(y 2 )-F 21 (z' 1 )av(yi) was 0.23992. Hence, it is 
concluded that the identification of F 21 (z _1 ) is better than that of F 12 (z _1 ). 
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ABSTRACT 

Cortical activity underlying speech perception has been studied mostly by using isolated vowels with constant formant frequencies. Speech, 
however, is characterized by formant transitions whereby formant frequencies change as a function of time. We used magnetoencephalography 
(MEG) to investigate cortical activity elicited by isolated vowels and diphthongs containing formant transitions. Ten subjects were presented with 
two isolated vowels /a/ and /u/ and diphthongs /au/ and /ua/. Stimulus duration was 200 ms, and the diphthongs started and ended with a 50-ms 
constant-formant period and included a 100-ms linear transition period. Apart from studying the auditory N 100m response, we examined subsequent 
brain activity in a 500-ms poststimulus time window, as the transitions were expected to elicit activity also in later stages of cognitive processing. All 
the stimuli elicited prominent N100m responses. Thereafter, the brain activity elicited by isolated vowels and diphthongs started to diverge at around 
200 ms. The present observations indicate that stimulus feature analysis (e.g., vowel identity) is reflected in the NIOOm whereas formant transitions 
are reflected in later brain activity. Notably, the stimuli appeared to elicit left-hemispheric activity resembling the N400, typically obtained by using 
more complicated speech stimuli such as words and sentences. 

KEY WORDS 

Human, Auditory, Cortex, Speech Perception, Vowel, Formant Transition, Diphthong, Magnetoencephalography, MEG, NIOOm, N400m 

INTRODUCTION 

Complex speech sounds such as syllables and words consist of different phonemes linked to one another. The physiological production of these 
high-level speech units corresponds to creating a continuum of articulatory movements, which gradually change the shape of the vocal tract. The 
main acoustical consequences of these changes in the human articulatory mechanism are the formant transitions whereby formant frequencies change 
as a function of time [Kent & Read, 1992]. With regard to speech perception, these formant transitions have been assumed to enhance word 
recognition by helping the listener predict the following phoneme. Consequently, this predictive ability rapidly reduces the cohort of word 
representations in the mental lexicon activated by the initial phoneme of the word [Marslen-Wilson & Welsh, 1978]. Unfortunately, the brain 
dynamics underlying these coarticulation processes are largely unknown because contemporary brain research has mainly focused on early cortical 
activity, most notably the auditory NIOOm response elicited by isolated phonemes. 

Here we report the preliminary results of a study where we explored the cortical activity elicited by isolated vowels and combinations of two 
vowels (i.e., diphthongs) containing formant transitions. We presented the subjects with the Finnish vowels /a/ (as in ‘hut’) and /u/ (as in ‘put’) as 
well as their combinations /au/ and /ua/. Since the isolated vowel and the corresponding diphthong beginning with the same vowel were acoustically 
identical during the first 50 ms, after which a 100-ms formant transition period in the diphthong took place, we expected the formant transitions to be 
manifested in the cortical activity subsequent to the NIOOm response. 

METHODS 

Ten subjects (mean age 25 years, 5 females) participated with informed consent in the study approved by the Ethical Committee of Helsinki 
University Central Hospital. The subjects reported being right-handed and having normal hearing. During measurements, the subjects, instructed not 
to pay attention to auditory stimulation, were concentrating on reading a self-selected book or watching a silent video. We used the Semi-synthetic 
Speech Generation method [Alku et al, 1999] to construct the stimuli. Firstly, the waveforms of the isolated vowels /a/ and /u/ were created by 
exciting an artificial vocal tract model with a real glottal pulseform extracted from a natural utterance. Hence, the vowels produced were acoustically 
different only in terms of their formant frequencies which were adjusted for the lowest four resonances of the vocal tract (F1-F4) as follows: /a/: FI = 
670 Hz, F2 = 1000 Hz, F3 = 1950 Hz, and F4 = 3440 Hz, /u/: FI = 330 Hz, F2 = 580 Hz, F3 = 1900 Hz, and F4 = 2900 Hz. The diphthongs were 
produced by using the same glottal excitation waveform but by continuously changing the formant frequencies of the artificial vocal tract model. In 
the beginning and at the end of the diphthongs, there was a 50-ms constant-formant period and between these two periods, there was a formant 
transition period during which the formants changed linearly. All the sounds were equalized in terms of their intensity by normalizing the energies of 
the sounds as computed by the squared sums of the digital time domain signals. 

For each subject, the sound pressure level was adjusted by using the vowel /a/ as a reference stimulus, thus producing an intensity range of 70-75 
dB SPL. The 200-ms stimuli were binaurally delivered to the subject through plastic tubes and ear pieces at an inter-stimulus interval of 800 ms and 
with more than 150 repetitions per stimulus type. The stimuli were divided into separate sequences presented in a pseudorandom order and their 
presentation order was counterbalanced across subjects. Cortical activation elicited by the stimuli was registered by using a 306-channel MEG 
measurement device (Vectorview, Elekta Neuromag Oy, Finland) in a magnetically shielded room. The evoked responses were averaged over a 
period of 600 ms including a 100-ms pre-stimulus baseline and filtered with a passband of 1-30 Hz. Epochs exceeding an absolute amplitude 
variation of 3000 fT/cm were excluded online, and electrodes monitoring horizontal and vertical eye movements were used in removing artefacts 
defined as the absolute value exceeding 150 pV. 

The auditory NIOOm as well as subsequent brain activity were studied for possible effects in response amplitude, latency, and source location. For 
each hemisphere and subject, the waveform analysis was carried out for the field gradient vector sums from the six pairs of planar gradiometers 
exhibiting maximum NIOOm responses. Equivalent current dipoles (ECD) were fitted to a time point defined as the moment of the NIOOm reaching 
its peak amplitude in the waveform calculated over the 66 sensors located over either the left or right temporal brain areas. When possible, ECDs 
were also fitted to later time points in the elicited waveform. ECDs could be realiably modelled at latencies of about 230, 340 and 420 ms. Statistical 
analyses of the data were performed by using repeated measures ANOVA and Newman-Keuls post hoc -tests. 
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RESULTS 

The analysis of the response waveforms revealed that all the stimuli elicited prominent NIOOm responses, with the diphthongs eliciting slightly 
shorter-latency NIOOm responses than the isolated vowels (121 vs 127 ms; F[l,22] = 18.7, P < 0.001). The right-hemispheric responses were larger 
than the left-hemispheric ones (41 vs 26 fT/cm; F[l,25] = 27.45, P < 0.001). As shown in Figure 1, the response waveforms elicited by the isolated 
vowels and diphthongs started to diverge at around 160 ms both in the left and the right hemisphere. Notably, in the left hemisphere, following the 
NIOOm, the diphthongs elicited an additional deflection at around 200 ms which commenced some 20 ms earlier than that elicited by the isolated 
vowels. For both the diphthongs and isolated vowels, this activity continued up to 500 ms, and was larger in amplitude for the diphthongs than for the 
vowels. In the right hemisphere, the isolated vowels elicited a response peaking at around 220 ms. Diphthongs elicited a similar, but larger-amplitude 
response peaking later, at 250 ms. The right-hemispheric response returned to baseline at 360 ms. In most subjects, these responses were amenable 
for ECD modeling. 

Figure 2 shows the ECD locations of the responses in the anterior- 
posterior and medial-lateral dimensions. Interestingly, for each stimulus type, 
cortical activity shifted in the anterior direction over time. The temporal 
evolution of this shift was different in the two hemispheres: In the right 
hemisphere, the NIOOm response and the subsequent response at around 
200-300 ms were located in the same cortical area, with only the activity at 
around 400 ms having shifted to anterior areas. However, in the left 
hemisphere, this anterior shift was observable already at 250-350 ms, and the 
ECD locations at 400 ms were approximately 10 mm anterior to the source 
of the auditory NIOOm. 

DISCUSSION 

Here we studied the cortical activity underlying the perception of isolated vowels and 
diphthongs containing formant transitions. In addition to the auditory N100m, all stimuli elicited 
additional responses in the 200-400 ms latency range. These responses were more prominent for 
the diphthongs than for the isolated vowels. By 400 ms, the brain areas activated by the isolated 
vowels and diphthongs had shifted anteriorly by approximately 10 mm, possibly indicating that 
the later activity, rather than being a sustained response originating from a single cortical area 
[Seifritz et al, 2002], is a manifestation of a process allocating activity to separate cortical areas. 

The time course of this anterior shift being faster in the left than in the right hemisphere is 
consistent with the idea of different right- and left-hemispheric temporal integration windows 
[Hickok & Poeppel, 2000]. The shorter integration period in the left hemisphere could be related 
to the left-hemispheric specialization in analyzing phonemic content, such as formant transitions 
in a speech signal. 

Further, the late responses reported here, elicited by 200-ms vowel stimuli in passive 
recording conditions, resemble the auditory N200/N400 responses obtained in experiments on 
word and sentence processing in attended conditions, where the N200 has been associated with 
phonological processing and the N400 with various stages of processing of word meaning [e.g., 

Connolly et al, 1992]. Our recent findings dealing with the prerequisites of localizing the N400m 
[Makela et al, 2001] show that the N400 elicited by long-duration words might in fact reflect the 
processing of acoustic properties of the target words. Establishing the relationship between the 
late responses observed here and in the auditory N200/N400-paradigms might be helpful for 
clarifying the neural basis of word recognition. 
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Figure 1. Grand-averaged left- and right-hemispheric 
responses to isolated vowel /u/ and diphthong /ua/. 
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ABSTRACT 

We studied the cortical activation underlying perception of glides in speech fundamental frequency (FO) as indexed by the amplitude, latency and 
source location of the auditory NIOOm response registered with magnetoencephalography (MEG). Ten subjects were presented with Finnish vowels 
with either a constant or an ascending/descending FO. We found that the human auditory cortex is sensitive to these time-varying changes in the FO of 
speech: vowels with a constant FO elicited more prominent NIOOm responses than did vowels with ascending or descending FO glides. These results 
suggest that the speech-related behavior of the NIOOm arises out of cortical sensitivity to variations in the FO and its harmonics which underlie the 
perception of pitch and intonation. The present observations are interpreted in terms of the interrelatedness of speech production and perception. 
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INTRODUCTION 

Speech fundamental frequency (FO) and its integer multiples (2 • FO, 3 • FO, etc.) are together perceived as voice pitch and their time-varying 
changes result in the perception of intonation in speech. Intonation is used in conveying the speaker’s intentions, emotions, and state of mind to the 
listener who uses this information in the interpretation of the speech message. Ascending and descending FO glides are produced in the human voice 
production mechanism by accelerating or decelerating the vibration rate of the vocal folds [Fant, 1970]. Consequently, the glottal pulseform (the 
periodic excitation waveform of voiced speech produced by the vocal folds) is modified. 

While both clinical evidence and hemodynamic brain measures have suggested that the right hemisphere is specialized in the processing of 
harmonic features of speech [Poeppel, 2003], our understanding of early (< 200 ms) cortical processing of speech F0 information has been limited to 
observations on activity elicited by sounds with a constant F0. Recently, we showed that speech sounds comprising a periodic structure originating 
from the natural glottal excitation waveform elicit larger N100m responses than the same speech sounds with an aperiodic structure produced by 
noise excitation [Alku et al, 2001] and that the cortical activity elicited by constant-FO speech sounds and tones of corresponding frequencies have 
separate time courses [Makela et al, 2002]. With these observations supporting the interrelatedness of speech production and perception [Liberman & 
Mattingly, 1989], we suggested that the ability to categorize sounds as speech or non-speech probably arises out of cortical sensitivity to the acoustic 
structure of natural speech, determined by the structure and functioning of the human voice production system. 

Here we elaborate on the theme by investigating cortical activity elicited by the F0 contours embedded in intonation of natural speech. To this 
end, we compared the processing of constant-FO speech sounds with the processing of ascending and descending speech F0 glides as indexed by the 
amplitude, latency and source location of the auditory N100m response. 

METHODS 

Ten right-handed adults (age 23-38 years, 3 females) participated with informed consent in the study approved by the Ethical Committee of the 
Helsinki University Central Hospital. During the measurement, the subjects, instructed not to pay attention to the auditory stimulation, were reading a 
self-selected book. The stimuli were produced with the Semi-synthetic Speech Generation method [Alku et al, 1999]. Firstly, a waveform of the 
vowel /a/ was created from a natural utterance by estimating the glottal excitation pulseform produced by the vibrating vocal folds and then using this 
signal as an input to an artificial vocal tract modelled by a digital all-pole filter. The F0 was 114 Hz and the lowest four formant frequencies of the 
vowel were set at 670 Hz (FI), 1000 Hz (F2), 1950 Hz (F3), and 3440 Hz (F4). Secondly, by manipulating the lengths of consecutive glottal cycles in 
the excitation waveform, the same vowel was produced with a constant F0 of 206 Hz and two linear F0 glides ascending from 114 Hz to 206 Hz and 
descending from 206 Hz to 114 Hz. The loudness of the 200-ms stimuli was equalized with a computational loudness model [Moore et al, 1996]. For 
each subject, the sound pressure level was adjusted at a comfortable hearing level (intensity range 70-75 dB SPL) by using the vowel with the F0 of 
206 Hz as a reference stimulus. The stimuli were binaurally delivered to the subject through plastic tubes and ear pieces at an interstimulus interval of 
800 ms and with >150 repetitions per stimulus. The stimuli were presented in pseudorandom order, counterbalanced across subjects. 

Cortical activation elicited by the stimuli was registered by using a 306-channel MEG device (Vectorview, Elekta Neuromag Oy, Finland) in a 
magnetically shielded room. The evoked responses were averaged over a period of 600 ms including a 100-ms pre-stimulus baseline and filtered with 
a passband of 1-30 Hz. Epochs exceeding an absolute amplitude variation of 3000 fT/cm were excluded online, and electrodes monitoring horizontal 
and vertical eye movements were used in removing artefacts (>150 pV). The NIOOm was studied for effects in response amplitude, latency, and 
source location. The waveform analysis was carried out for the vector sum of the pair of planar gradiometers showing the maximum NIOOm, 
separately in each hemisphere and for each subject. For equivalent current dipole (ECD) estimation, a subset of 66 sensors located over either the left 
or right temporal brain areas was used, with the ECDs fitted to a time point defined as the moment of the NIOOm reaching its peak amplitude in the 
waveform calculated over these sensors. Statistical analyses were performed by using repeated measures ANOVA and Newman-Keuls post hoc tests. 

RESULTS 

The analysis of the NIOOm response waveforms revealed that the constant- and gliding-FO vowels had distinctly different effects on the temporal 
dynamics of the right- and left-hemispheric cortical activity (F[3,24] = 3.26, P < 0.05). Interestingly, the right-hemispheric NIOOm responses to 
vowels with F0 glides had a significantly shorter latency than the left-hemispheric ones (descending FO-glide: 110 & 121 ms for the right and left 
hemisphere, respectively (P < 0.01); ascending FO-glide: 112 & 119 ms for the right and left hemisphere, respectively (P < 0.05)). However, the two 
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hemispheres responded to the constant-FO stimuli at the same latency (113 & 116 ms for 206 & 114 Hz F0 in the right hemisphere; 117 & 118 ms for 
206 & 114 Hz F0 in the left hemisphere; P — n.s. in all comparisons). 

Furthermore, the analysis of the amplitude of the NIOOm response waveforms for constant- and 
gliding-FO vowels, depicted in Figure 2, revealed that there were distinctly different effects on the 
NIOOm amplitude (F[4,36] = 5.26, P < 0.01). Importantly, the vowels with F0 glides elicited 
significantly smaller-amplitude NIOOm responses (54 & 55 fT/cm for the descending and ascending 
glides, respectively) than the vowels with a constant F0 (67 & 71 fT/cm for 206 and 114 Hz, 
respectively; P < 0.05 for 206 Hz vs. the descending and ascending F0 glides; P < 0.01 for 114 Hz vs. 
the descending and ascending glides). No effects of stimulus frequency on the amplitude of the NIOOm 
were observed in the right and left hemisphere (F[4,36] = 0.96, P = n.s.), although the right-hemispheric 
responses were larger than the left-hemispheric ones (70 & 52 fT/cm; F[ 1,9] = 6.62, P < 0.05). 

The analysis of the ECD locations of the NIOOm revealed that both the constant- and gliding-FO 
vowels activated the same brain area in the auditory cortex of each hemisphere, with the right- 
hemispheric ECD locations being more anterior than the left-hemispheric ones (8.5 and 5.5 mm, 
respectively; F[l,8] = 4.54, P < 0.05). The average right- and left-hemispheric ECD locations of the 
NIOOm were 48/-49 mm in the medio-lateral direction (F[l,8] = 0.02, P = n.s), and 54/55 mm in the 
inferior-superior direction (F[l,8] = 0.60, P = n.s.). 

DISCUSSION 

Here we investigated the cortical activation underlying the perception of speech intonation. By using 
vowels with constant and gliding FOs within the FO-range of average male and female speech, we found 
that the human auditory cortex is susceptible to subtle time-varying changes in speech F0: While the 
speech sounds with a constant F0 elicited prominent NIOOm responses, the amplitudes of the NIOOm 
responses for ascending and descending F0 glides were considerably attenuated. Furthermore, 
corroborating the idea of the right-hemispheric specialization in the processing of harmonic information 
in speech [Poeppel, 2003], the right hemisphere responded to ascending and descending FO-glides at a 
shorter latency than the left hemisphere. 

Thus, both the amplitude and the latency of the NIOOm appear to reflect the harmonic spectral 
contents of stimulation. The present results, in concordance with our previous observations [Alku et al, 

2001] [Makela et al, 2002], therefore indicate that speech is processed in a specialized way in the human 
auditory cortex. This speech-specific processing is likely to arise from cortical sensitivity to the 
harmonic spectral structure of voiced speech determined by the periodicity of the glottal excitation, 
which in turn defines pitch and intonation. This conclusion is supported by the observation that the 
ascending and descending F0 glide of the vowel was reflected in the amplitude of the NIOOm. These 
amplitude modulations were probably due to acoustical cues generated by a change in the periodicity of 
the glottal excitation which affected not only the F0 but also changed the frequency and amplitude 
compositions of the harmonics [Fant, 1970]. 

The processes specifically related to the perception of the direction (e.g. rise/fall) of F0 glides in speech sounds, however, require further 
clarification. The near-identical NIOOm amplitude attenuation in response to speech sounds with the ascending and descending FO-glides observed 
here may be explained by the rate of the change of the F0 being linear in both cases. Further studies are needed to establish whether the range (Hz) or 
the rate (Hz/ms) of the F0 glide in speech is reflected in the amplitude of the NIOOm response. 
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Figure 1. The right-hemispheric 
ECDs of the NIOOm fitted to the 
grand-averaged waveforms for the 
vowel /a/ with constant (left) and 
gliding F0 (right). 
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Figure 2. The effects of constant 
F0 (left) and F0 glides (right) on 
the amplitude the grand-averaged 
NIOOm (error bars indicate the 
standard error of the mean). 


518 

















P4-1 


Auditory Evoked Responses Are Additive to Brain Oscillations 

Makinen, V.T. 1 ’ 2 , Tiitinen, H. 1 ’ 2 , and May, P.J.C. 1 ’ 2 

Apperception & Cortical Dynamics (ACD), Department of Psychology, University of Helsinki, Finland; BioMag Laboratory, 

Engineering Centre, Helsinki University Central Hospital, Finland 


ABSTRACT 

The generation mechanism of stimulus-evoked electro- and magnetoencephalographic (EEG & MEG) responses has remained controversial. One 
view holds that evoked responses are independent components, additive to ongoing brain activity. The other view holds that evoked responses are 
generated via stimulus-induced phase reorganization of ongoing brain activity. This issue has been commonly addressed with signal processing 
techniques that assume a high level of stationarity (i.e., unchanging properties over time) of the measured signal. Here we used signal analysis 
methods suitable for analyzing non-stationary signals. We found that auditory stimulation leads to a large power increase of the poststimulus signal 
compared to prestimulus level. Linear superposition of the (time-domain) averaged response and the unaveraged prestimulus signal accounted for 
90% of the power increase. Further, we found that auditory stimulation does not lead to a phase-coherent state of ongoing oscillations. Taken together 
our results show that auditory evoked responses are directly additive to ongoing oscillations and only 10 % of the observed power increases are 
explained by non-phase-locked brain activity. When examining evoked brain activity with methods providing simultaneous frequency and time 
information, emphasizing temporal accuracy is likely to provide more accurate descriptions of non-stationary processes of the human brain. 
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INTRODUCTION 

The neural underpinnings of human sensory and cognitive processes can be non-invasively explored with MEG and EEG through stimulus- 
evoked brain responses. The human brain, however, displays ongoing oscillatory signals in MEG and EEG even without sensory stimulation. One 
possibility is that evoked responses are the result of stimulus-induced phase synchronization of ongoing oscillations [Sayers et al, 1974] [Makeig et 
al, 2002] [Jansen et al, 2003] [Klimesch et al, 2004]. The alternative possibility is that evoked responses are separate from and additive to ongoing 
oscillations. The examination of this issue is intimately tied to the methodology used for the analysis. Most established signal analysis methods are 
based on the theory of statistical signal processing [e.g. Hayes, 1996], where signals are commonly assumed to be stationary (i.e., their statistical 
properties do not change over time). Recent methods such as wavelet and short-time Fourier transforms (FTs) are well suited for the analysis of 
quasistationary signals such as ongoing brain oscillations which are approximately stationary over some short time window. However, when these 
methods are applied to transient brain responses - which (by definition) are neither stationary nor quasistationary - misinterpretations may result. 

FT-based spectral analysis tools are widely available and they have been used for examining whether EEG or MEG signal power is affected by 
sensory stimulation. However, the results of the analysis are affected by how the data is preprocessed (e.g., in terms of the length of the analyzed time 
window, the weighting window, etc.). Further, although FT-based spectral estimation provides accurate analysis of stationary signals only, negative 
results have been used as evidence of the absence of additive components to ongoing brain oscillations [Sayers et al, 1974] [Makeig et al, 2002]. It is, 
however, possible to measure the power of a signal with minimal a priori assumptions and without spectral estimation; the power of a signal is 
defined as its integrated squared magnitude and filtering can be used to focus on the frequency band of interest. Here we evaluated the effect of 
auditory stimulation on MEG power with this method and examined the averaged auditory responses as an explanatory factor of the power changes. 

The question of the generation of evoked responses has also been approached through measuring intertrial coherence (ITC), i.e., the phase 
distribution of MEG or EEG signals over trials [Makeig et al, 2002]. This analysis method focuses on narrow frequency bands and specific brain 
rhythms. The principle of time-frequency uncertainty [Gabor, 1946] [Grochenig, 2001], however, dictates that with narrow frequency bands events 
accurately localized in time become dispersed and affect the power and phase of the signal far from their actual latency of occurrence, thus distorting 
the results. We have previously introduced a method for evaluating ITC without requiring strict limiting of the frequency band [Makinen et al, 
Submitted] and the issue is further elaborated on here. 

METHODS 

Ten healthy human subjects were studied with their informed and written consent. The study was approved by the Ethical Committee of Helsinki 
University Central Hospital. The measurements were carried out in a magnetically shielded room with a 306-sensor MEG device (Vectorview, Elekta 
Neuromag Oy, Finland). The subjects watched a silent film and were under instruction to ignore the auditory stimuli. The stimuli were 750-Hz tones 
of 50-ms duration with 5-ms linear onset and offset ramps. The stimuli were adjusted to 80 dB (sound pressure level, A-weighted) and presented 800 
times using an onset-to-onset interstimulus interval of 1200 ms. The data were collected using a sampling rate of 600 Hz and a pass-band of 0.03-200 
Hz. The data were divided into 1200-ms epochs (starting 400 ms before stimulus onset) and epochs coinciding with >100 pV EOG amplitudes were 
excluded from analysis. Epochs were also rejected if their mean variance over all sensors (1-200 Hz band) exceeded two standard deviations (SD; 
calculated over all trials). 

For each subject, data was analyzed using the sensor which displayed the NIOOm response of largest amplitude. The data was filtered with 
Chebyshev type-II filters optimized for maximal steepness of transition from pass-band to stop-band. Stop-band attenuation was set to 40 dB and 
two-way zero-phase filtering was employed. The signal was band-limited to 4-40 Hz for the examination of ITC, which was evaluated as the SD of 
signal amplitudes over trials calculated at each time point of the epoch. The mean of the -400...0 ms prestimulus values was used as the baseline SD 
and the mean of values over 75... 125 ms was used as the SD of the NIOOm. Signal power was calculated from the unaveraged data (4^10 Hz pass- 
band) as the mean of the squared signal in 250-ms pre- and poststimulus time windows. The same frequency band and time windows were used with 
the superposition of averaged responses to the unaveraged prestimulus data. Statistical significance was evaluated with Mests. 
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RESULTS 


Auditory stimulation elicited a prominent NIOOm response peaking around 100 ms from stimulus 
onset (Fig. la; unfiltered data). The response was sharp and well localized in time thus resembling an 
impulse response, which has energy at all frequencies. When the frequency band of the response is 
limited the response inevitably spreads in time (Figs. lb-d). Narrow-band filters and wavelets are often 
used in the analysis of signal phase and ITC. Yet, as demonstrated in Figs, lb-d, although the unfiltered 
signal is a transient deflection it determines the phase of the oscillatory response manufactured by the 
filtering. Therefore, to analyze the ITC properly one must use a sufficiently wide frequency band in 
which case the ITC evaluation can be performed through analysis of the intertrial amplitude variance 
over time. To elaborate, a stimulus-induced phase coherent state of oscillation is analogous to an 
amplitude coherent state and can be seen as a reduction of the trial-to-trial amplitude variance compared 
to baseline. However, we observed an increase of 5 % (^[9] = 2.9, P < 0.01) in the amplitude variance at 
the latency of NIOOm (Fig. le). Thus, the ongoing oscillations are not in a coherent state at this latency. 

Auditory stimulation increased the poststimulus signal power of the unaveraged data in the 4-40 Hz 
frequency band by 27 % (*[9] = 3.4, P < 0.01) and a direct linear superposition of the averaged response 
and the unaveraged prestimulus data, separately carried out for each subject, accounted for 90% of the 
power increases. 

DISCUSSION 

Here we observed that no phase synchronization of ongoing oscillations accompanies the emergence of 
auditory evoked responses in MEG, corroborating our previous results obtained with spatially filtered 
data [Makinen et al, Submitted]. The slightly increased incoherence of amplitude is explained by the 
trial-to-trial amplitude variance of the NIOOm response which is additive to that of the ongoing 
oscillations. Analysis of signal power, based directly on the definition of signal energy, revealed 
prominent power increases produced by auditory stimulation. Only 10 % of these increases were 
explained by changes in non-phase-locked brain responses. Thus, 90 % of the auditory evoked processes 
are accounted for by the waveform observed with stimulus time-locked averaging being additive to 
ongoing oscillations. 

Methods intended for analyzing stationary periodic data (e.g. narrow-band filters) will unavoidably 
provide inaccurate results on non-stationary signals such as evoked responses. More generally, signals 
generated by the brain have both time and frequency aspects and, depending on the emphasis of the 
analysis, very different descriptions can be obtained from the same data. For example, wavelets with a 
high temporal resolution yield information on the properties of each evoked response, whereas wavelets 
with high frequency resolution reflect the temporal distribution of different evoked responses [Makinen 
et al, In press]. The question whether a signal yields more meaningful information in the frequency (i.e. 
rate) or time domain is important for analysis of EEG and MEG data but might also hold the key for 
understanding the neural code on the single-cell level. 
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ABSTRACT 

Brain processes phase-locked to stimuli can be readily observed with electro- and magnetoencephalography (EEG & MEG, respectively) using 
stimulus-triggered averaging of the measured signal. The detection of non-phase-locked brain processes depends on the method used for analyzing 
the unaveraged data. Here we introduce a technique, partition-referenced moment (PRM) power spectrum, which uses established spectral estimation 
algorithms but yields a power spectrum with sharp, easily distinguishable peaks in an otherwise level spectrum even when the signal (such as EEG & 
MEG) is of the one-over-frequency-slope type. Employing this method and wavelet transforms, we show that transient auditory brain responses are 
followed by dispersed small-magnitude power reductions. Power reductions in the 15-30 Hz frequency range occurring around 200-400 ms after 
stimulus onset do not reflect modulation of a specific ongoing oscillation. In contrast, power reductions occurring around 400-600 ms were specific 
to ongoing 10 Hz-oscillations. Thus, the presented methods enable the straightforward detection of ongoing brain oscillations and their association 
with event-related power changes. 

KEY WORDS 

1/f, Auditory, Electroencephalography, Magnetoencephalography, Desynchronization, Ongoing oscillations, Spectral estimation, Wavelets 

INTRODUCTION 

Spectral estimation is the basic statistical signal processing method and an obvious tool for examining ongoing oscillations in the EEG or the MEG. 
Brain processes, however, do not exhibit a high degree of stationarity (i.e., their statistical properties change over time) and therefore spectral 
estimation needs to be performed using relatively short time windows. From this, it follows that spectral peaks are often broad and ill-defined. 

Further, the spectrum obtained from an EEG or a MEG signal has a one-over-frequency (1/f) shape, characteristic for many natural phenomena [e.g. 
Mandelbrot, 1999]. At low frequencies, there is more power per frequency unit than at high frequencies and, because of these large-magnitude 
differences, power spectra are typically viewed on a logarithmic scale. This usually leads to further complications in identifying peaks corresponding 
to brain oscillations. With the spectral peaks not being accurately aligned (e.g. over subjects), averaging additionally spreads the spectral peaks along 
the 1/f slope. The peaks in the power spectrum could be enhanced by calculating moments but this would increase even further the large magnitude 
differences inherent in the 1/f spectrum. Here we introduce the partitioned-referenced moment (PRM) method, which eliminates these problems. To 
demonstrate the effectiveness of this method, the spectrum obtained from MEG data is used to associate ongoing oscillations to power reductions 
elicited by auditory stimulation. 

Event-related power reductions (event-related desynchronization, ERD) are actively studied in the operation 
of the human sensorimotor system. With auditory stimulation, ERDs have been observed under special 
conditions [Sutoh et al, 2000] [Kaiser et al, 2003] but have been interpreted to reflect motor and/or cognitive 
operations rather than being elicited by auditory stimulation alone [Krause, 1999]. We have recently found 
that even sine tone stimulation in passive recording conditions elicits power reductions in auditory and 
parietal brain areas [Makinen et al, In press]. Utilizing auditory stimulation, MEG recordings, and a 
combination of the novel PRM method and wavelets, we examined power reductions occurring in the brain. 

METHODS 

Data collection, artefact rejection and wavelet analysis: Ten healthy human subjects were studied with 
their informed and written consent. The study was approved by the Ethical Committee of Helsinki University 
Central Hospital. The measurements were carried out in a magnetically shielded room with a 306-sensor 
MEG device (Vectorview, Elekta Neuromag Oy, Finland). The subjects watched a silent film and were under 
instruction to ignore the auditory stimuli. The stimuli were 750-Hz tones of 50-ms duration (with 5-ms linear 
onset and offset ramps) adjusted to 80 dB (sound pressure level, A-weighted) and presented 800 times using 
an onset-to-onset inter-stimulus interval of 1200 ms. The data were collected using a sampling rate of 600 Hz 
and a pass-band of 0.03-200 Hz. The data were divided into 1200-ms epochs (starting 400 ms before 
stimulus onset) and epochs coinciding with >100 pV EOG amplitudes were excluded from analysis. Epochs 
were also rejected if their mean variance over all sensors (1-200 Hz band) exceeded two standard deviations 
(SD), with the SD calculated from all trials. For each subject, data was analyzed using the vector sum from 
the gradiometer pair which displayed the largest-amplitude NIOOm response. Second- and tenth-derivative- 
of-Gaussian wavelets (DOG2 & DOG 10, respectively), set at 1 Hz intervals, were employed (algorithm 
modified from [Torrence and Compo, 1998]). The baseline was calculated from -350 ....-150 ms before 
stimulus onset at each frequency to avoid generating artificial ERDs [Makinen et al, In press]. Probability 
maps were calculated with respect to SDs of the baseline. The lowest significance value shown in Fig. 1 
corresponds to P < 0.05 divided with the number of independent time-frequency tiles [Grochenig, 2001] of 

that area. 

Partition-referenced moment (PRM) power spectra: With the event-related spectral changes limited to the 
first half (0-600 ms) of each trial (Fig. 1) the 600-1200 ms portion of the data was used in the spectral 
analysis performed using a periodogram (2048-point FFT, zero-padding, and data-window detrending). If the 
oscillatory signal is fairly stationary within the analyzed time window, the spectral peak of the oscillation 
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Figure 1 . Auditory event-related 
power decreases obtained with 
a) second and b) tenth derivative 
of Gaussian wavelets. 


521 













P4-1 


becomes sharper as the length of the time-window is increased [e.g. Hayes, 1996]. By partitioning the 
original data window into two or more parts and taking the mean of spectra calculated from these parts, 
one obtains a reference spectrum. Stochastically, the difference between the reference and full-length 
spectrum is, firstly, that the full-length spectrum contains sharper and higher amplitude peaks of the 
underlying stationary oscillations and, secondly, that low frequencies with wavelengths corresponding 
to the full-length window are not present in the reference spectrum. After utilizing appropriately-sized 
subaverages (here, all trials obtained at a single MEG sensor) the mean of the logarithm of the spectra 
of the partitions is subtracted from the logarithm of the spectrum of the full-length time-window. The 
resulting difference spectrum is level (mean = 0) but, importantly, the peaks of the oscillations remain 
and are surrounded by negative deflections which are due to the broad peaks of the spectra of the 
partitions. Because of these negative values, peaks not accurately aligned effectively cancel each other 
out when averaging over the subaverages. Therefore, the difference spectrum is biased towards mean = 

1, and moments can be calculated (16 th moment was used here). The biasing and moments have a 
nonlinear effect in increasing the >1 values more than reducing the <1 values, thus preventing the 
cancellation effectf. 

RESULTS 

The PRM-periodogram produced a level spectrum interspersed with peaks which are clearer and 
sharper compared to those in the 1/f spectrum obtained with a traditional periodogram (Fig. 2; the power peak at 50-Hz is due to the mains power). In 
the PRM spectrum, prominent peaks are observed at 10 and 30 Hz, with the latter probably being a harmonic of the former. The peak around 5.5 Hz 
is probably theta oscillation and, again, the peak at around 17 Hz is likely to be a harmonic of this component. In the wavelet analysis, power 
reductions were observed in two time-frequency regions (Fig 1). The magnitude of the 9-12 Hz power reduction in the 400-600 ms range (DOG2; 
3.5 %, DOG10; 4.0 %) was of the same order as that obtained at 16-23 Hz in the 250-350 ms range (DOG2; 2.3 %, DOG10; 3.0 %). The power 
decreases appear as ripples but this may be attributed at least partially to the multiple maxima of the used real-valued wavelets. 

DISCUSSION 

We introduced a technique which yields an easily analyzable spectrum for a 1/f process. This PRM spectrum has sharp and well-defined spectral 
peaks but is otherwise level. Unlike other methods for removing the 1/f slope, the current method does not require parameters to be searched and 
avoids many over- and under-fitting problems. Moreover, this method is not sensitive to peaks of the data per se but, rather, is a method for detecting 
approximately stationary oscillations buried in noise. A more detailed analysis of the method, however, needs to be performed in the future regarding 
its sensitivity to the stationarity of the underlying signals and the number of partitions and subaverages. This method may be of general use in the 
analysis of noise-buried oscillations and could be recommended for examining ongoing EEG and MEG oscillations. 

Here, the method proved useful for associating ERDs to ongoing oscillations. We observed that auditory stimulation elicits power reductions in 
two regions of the time-frequency plane. Using the PRM method, the reduction in the 10 Hz range was associated with the desynchronization of a 
specific and prominent ongoing oscillation. However, power reductions in the 16-23 Hz range, similar to those found in [Makinen et al, In press], 
were not associated with any peaks in the PRM spectrum, and therefore are probably due to changes in less organized fast fluctuations of brain 
activity. Taken together, our results indicate that power reductions are an integral part of auditory event-related processes. 
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Here psd is the spectral estimation algorithm which includes data preprocessing. W is the length of the single trial analyzed data-window and P is the 
number of its partitions. S is the number of trials in the subaverage. V ones is a vector of ones of the length of the psd estimate. Mis the moment (> 2 
recommended). N is the number of subaverages. This formula can be adjusted according to the properties of the analyzed data. 
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The MMN Is a Derivative of the Auditory N100 Response 

May, P. J. C. and Tiitinen, H. 

Apperception & Cortical Dynamics (ACD), Department of Psychology, University of Helsinki, Finland 


ABSTRACT 

We argue that the mismatch negativity (MMN), elicited by rare auditory events, is generated by the same neural mechanisms as the N100, elicited 
by any audible stimulus. To date, the MMN has been considered to be a unique index of auditory sensory memory and a component which is 
functionally and spatially separate from the N100 response because: (i) MMN and N100 appear to have different generator locations; (ii) the MMN 
occurs too late to be an N100; (iii) the MMN, as opposed to the N100, is elicited by stimulus omissions. By utilizing neural modeling and EEG/MEG 
results, we show that the above reasoning relies on unwarranted assumptions and propose that the MMN is, essentially, an amplitude- and latency- 
modulated N100 response. This study offers a physiologically constrained and theoretically plausible framework whereby brain dynamics in terms of 
stimulus feature maps and their reorganization may be used to describe various memory- and learning-related effects of human auditory cognition. 

KEY WORDS 

Human, Auditory, Cortex, MMN, N100, Electroencephalography, EEG, Magnetoencephalography, MEG, ERP, ERF, Neural Modeling. 

INTRODUCTION 

The MMN is a component of the event-related field and potential (ERF & ERP, respectively) which has been suggested to reflect multifarious 
aspects of cognition, including auditory sensory memory, automatic analysis of the auditory environment, sensory intelligence, processing of speech, 
learning, and even long-term memory [e.g., Naatanen & Alho, 1995]. In addition, it has been suggested to serve as an objective indicator for a 
plethora of clinical conditions, such as hearing disorders, coma, Alzheimer’s disease, schizophrenia [Naatanen et al, 2003], to name but a few. The 
MMN is elicited when repetitive ‘standard’ auditory stimuli are occasionally interspersed with ‘deviant’ stimuli audibly differing from the standards. 
The standards elicit a series of deflections in the ERF/ERP the most prominent of which is termed the N100 (NIOOm in MEG). The deviants typically 
elicit a slight increment in the ERF/ERP at and after the latency of the N100, and the residue between the responses to the deviant and the standard, 
obtained via subtraction, is labeled the MMN. 

It has been argued that the MMN cannot simply be explained through response recovery whereby non-adapted or non-refractory afferent neurons 
tuned to the deviant produce a larger N100 response than the adapted neurons tuned to the standard. The main arguments for this separation between 
the N100 and MMN arise, firstly, from equivalent current dipole (ECD) modeling which indicates that these responses have different generator 
locations [Alho, 1995]. Secondly, the MMN occurs later than the N100, and must therefore be produced by dedicated neural mechanisms [Naatanen 
& Alho, 1995]. Thirdly, as the MMN is elicited by stimulus omissions it cannot be due to the activity of afferent neurons, because these cannot be 
activated by a physically absent stimulus [Yabe et al, 1998]. The “fresh afferents” (i.e., the N100) explanation having thus been discarded, no models 
of the MMN grounded in physiological results have been sought, which is surprising considering the many cognitive and pathological significances 
accorded to the MMN. Here, we reconsider the arguments for treating the N100 and MMN as separate components in light of our recent neural 
modeling work combining animal model results and MEG/EEG measurements [May, 1999] [May et al, 1999] [May & Tiitinen, 2001]. 

METHODS 

MMN source location : In line with results from primate models showing the relative locations of auditory areas differently sensitive to tone 
frequency [Merzenich & Brugge, 1973], we used ECDs to describe the activity of an anteriorly located, tonotopically organized Area 1 with nerve 
cells sharply tuned to tone frequency and a posterior, non-tonotopical Area 2 comprising cells with broad tuning curves. These areas (displacement = 
1 cm) were assumed to include adaptation mechanisms, whereby repeated presentation of a stimulus results in diminished responses by cells tuned to 
that stimulus. Consequently, the standard stimulus activates weakly both Area 1 and 2 (5 nAm for each ECD). However, because of the differences in 
tuning sharpness, the deviant activates non-adapted and adapted cells of Area 1 and 2, respectively, which respond strongly (30 nAm) and weakly (5 
nam), respectively, to the deviant. In a spherical head model, we solved the forward problems for the responses to the standard and the deviant, and 
then fitted an unconstrained ECD to the “N100” response to the standard and the “MMN” derived through subtracting the N100 from the response to 
the deviant. Source locations between the MMN and N100 were then compared. 

MMN latency : The response dynamics of Area 1 and 2 were described through a simplified description of the canonical cortical microcircuit 
including recurrent excitation and feedback inhibition (for a thorough exposition, see [May et al, 1999]): 

™ e 0) = -u c (t) + w cc a(t)g[u c (t)] + w el g[M,(0] + k(t) + I(t), ™,(0 = -Uj(0 + w ic a(t)g[u c (t)], 

where u Q and u\ are the average membrane potentials of the subpopulation of excitatory (pyramidal) and inhibitory (stellate) cells, respectively, x = 80 
ms is the membrane time constant, w ee = 2.0 is the synaptic weights for recurrent excitation, w ei = w ie =1.7 relay feedback inhibition, / is the afferent 
input (100 ms duration, 0.005 amplitude), g = 1 \u]u determines the firing rate (1[.] is the unit step function), a is the firing rate coefficient accounting 
for spike-frequency adaptation (affecting pyramidal cells only), and k is the term describing lateral inhibition from neighboring microcircuits. The 
dynamic terms a and k were assumed to evolve on a much slower time scale than the membrane potentials, and were therefore approximated as 
quasistatic. The MEG (1-20 Hz pass-band filtered) was assumed to be directly proportional to the magnitude of feedback excitation, and the 
contribution from Area 2 was weighted by 2.0 (to reflect the larger number of responding cells). The values of a and k were used to describe the 
effect of lateral inhibition on the responses in Area 1 and the uniform adaptation of all cells in Area 2. For the response to the standard, Area 1 and 2 
were both assigned a = 0.125, k = 0, and for the deviant, a = 1 ,k = -0.0028 and a = 0.125, k= 0, respectively. In addition, we obtained the response 
elicited by a ”long-ISI” stimulus arriving after a long silent period when Area 1 and 2 are at “rest” (a= 1, k = 0, for both). 

MMN to stimulus omission : We used modeling techniques and MEG data described in [May & Tiitinen, Submitted] to gain a model of auditory 
cells tuned differentially to the frequency of amplitude modulation (AM). For parameter fitting, we used single-subject data exhibiting stimulus 
omission responses. We presented 50-ms stimuli to the model at an ISI of 104 ms for 2 seconds, and observed the effect of stopping the stimulation. 
For both experimental and simulation data, 100 ms pre-stimulus/omission baselines and 1-20 Hz pass-band filtering were used. 
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RESULTS 


MMN source location : The standard and the deviant activated the same brain areas. However, there 
was a 5 mm shift between the ECD location of the N100 and the MMN (Fig. 1, top). This was of the 
same magnitude as the separation between Areas 1 and 2 containing cells with sharp and broad tuning 
curves, respectively: For the N100 response to the standard, the ECD settled in a region midway between 
the two areas, but, for the MMN to the deviant, the ECD location was weighted towards the area with the 
sharper tuning curves and stronger activation. By increasing the activation difference between the two 
areas, one obtains even larger ECD location shifts between the N100 and MMN. With the dynamics of 
auditory cortex explaining these shifts, no assumptions of a separate MMN “component” are warranted. 

MMN latency : The response elicited by the frequency deviant was delayed compared to both the 
response elicited by the standard and by the long-ISI stimulus (Fig. 1, middle). This effect can be traced 
to the dynamics of Area 1 , where lateral inhibition impinging on the neural population tuned to the 
deviant delayed the response elicited by the deviant by effectively raising the threshold for the feedback 
excitation to be triggered. Importantly, the same areas were activated by all the stimuli, and no separate 
MMN “component”, generated by ill-defined “comparison processes” was required. 

MMN to stimulus omissions : Both the model and MEG recordings show that while an unvarying 
stream of standard stimuli elicit auditory N100 responses, the ending of the stimulus stream (essentially, 
a stimulus “omission”) elicits a prominent response at the latency of the N100 (Fig. 1, bottom). On the 
basis of the model, this N 100-like response emerges because auditory areas exhibiting tuning to 
stimulation rate continue their oscillatory activity. Again, no separate MMN “component” specifically 
sensitive to changes in the auditory environment needs to be postulated, however. 

DISCUSSION 

Our modeling results clearly show that the main arguments for the MMN and N100 responses being 
produced by different neural mechanisms are made unwarranted by considering the physiology of 
auditory areas. According to our model the standard and the deviant activate the same cortical areas, and 
this already accounts for the experimentally observed source location and latency differences between the 
N100 and MMN as well as the omission MMN. In sum, our results provide a unified theoretical 
framework according to which the MMN is, essentially, an amplitude- and latency-modulated N100. 

Our results are supported by previous findings. Firstly, the suggestion originally put forward in [May, 
1999] that source location differences between the N100 and MMN are due to differentially tuned 
auditory areas has recently been verified in fMRI and MEG measurements in humans [Jaaskelainen et al, 
2004]. Secondly, the exact shape of the latency function predicted by lateral inhibition was verified in 
[May et al, 1999]. Thirdly, the omission MMN is accounted for by our model, which describes how AM- 
tuned cells contribute to the N100 [May & Tiitinen, Submitted] [May & Tiitinen, 2001]. 

We posit that MMN might be a general indicator of cortical feature maps. Although requiring further 
verification, this hypothesis would allow feature map dynamics and their reorganization to serve as an 
explanation of learning and memory effects visible as amplitude changes in the MMN - or, that is, in the 
auditory N100 response elicited by the deviant stimulus. 
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Auditory Scene Analysis and Sensory Memory: the Role of the Auditory NIOOm 

May, P. J. C. and Tiitinen, H. 

Apperception & Cortical Dynamics (ACD), Department of Psychology, University of Helsinki, Finland 


ABSTRACT 

We consider the neural dynamics underlying auditory streaming, the perceptual grouping of transient auditory events, by using neural modeling 
and magnetoencephalographic (MEG) measurements in humans. We demonstrate that spatial variations in the strength of feedback inhibition leads to 
differential amplitude modulation (AM) tuning resembling that found in animal models. In our model, neurons respond selectively to stimuli 
presented at different interstimulus intervals (ISIs), and their summed activity (corresponding to the MEG signal) exhibits both transient and 
sustained responses (SRs) at fast ISIs. In MEG measurements utilizing 2-s trains of 50-ms stimuli presented at 0 - 1950 ms ISIs, we observed the 
transient N100m and SRs predicted by the model, with a prominent SR emerging for discrete stimuli at ISIs below 200 ms. Our results explain why, 
at fast stimulus rates, the amplitude of the auditory NIOOm appears to be strongly attenuated even though auditory cortex continues to respond 
vigorously to the stimuli. The results suggest that the longer and shorter forms of auditory sensory memory may be reflected in the NIOOm and the 
SR, respectively. As the emergence of the SR coincides with the stimuli being perceived as auditory streams, our study suggests that auditory sensory 
memory as indexed by transient and sustained cortical activity might underlie auditory scene analysis. 

KEY WORDS 

Human, Auditory, Cortex, Streaming, Sensory Memory, Magnetoencephalography, MEG, Neural Modeling, NIOOm, Sustained Response, SR. 

INTRODUCTION 

In natural environments, the human auditory system is bombarded with signals overlapping in time and spectral content. The study of auditory 
scene analysis [Bregman, 1990] describes how humans deal with these signals by segregating sound sources and focusing their attention on a 
particular source. Segregation is accomplished by assigning auditory events to perceptual entities, streams, according to sound frequency, stimulation 
rate, and sound source location. For example, at very slow presentation rates (< 0.5 Hz), transient sounds are perceived as singular events, but at 
faster rates sounds are heard as part of a stream stretched out in time. Contrasting the approach of auditory scene analysis, which emphasizes the 
inclusion of individual stimuli to perceptual wholes, the study of auditory sensory memory [Cowan, 1995] emphasizes the brain and behavioral 
responses to individual stimuli. In particular, variations in these responses as a function of ISI are interpreted to be caused by a memory trace left by 
previous stimulation. For example, the presence of such a trace attenuates the NIOOm response [Lu et al, 1991]. 

As faster stimulation rates lead to streaming on the one hand and stronger sensory memory traces on the other, one may ask how streaming and 
sensory memory are related to each other. Given that sensory memory has been suggested to have a physiological index in the NIOOm response 
(N100 in EEG), this question may be worth approaching on the physiological level. Here, extending our previous neural model on stimulus rate 
sensitivity of auditory cortex [May & Tiitinen, 2001], we present preliminary results on a dynamical-systems description on how human brain 
activity is modulated by rarely and rapidly presented auditory stimuli. 

METHODS 

Simulations: The dynamics of auditory-area columns were modeled through a simplified description of the canonical cortical microcircuit: 

Tu e (t) = -u e (t)+w ee a(t)g[u e (t)]+ w ei g[wi(0]+ Kt), nij(0 = -u { (t) + w ie a(t)g[u e (t)], (1) 

where u Q and u x are the average membrane potentials of the subpopulation of excitatory and inhibitory cells, respectively, x = 80 ms is the membrane 
time constant, w ee = 2.0 is the synaptic weight for recurrent excitation, w ei and w ie relay feedback inhibition, and / is the afferent input. With the firing 
rate having a minimum -0 and a saturation value 0, g linearly transforms u to intermediate firing rates according to g(u ) = u (i.e., the resting state u Q = 
0 assumes spontaneous firing). The term a accounts for the spike-frequency adaptation of excitatory cells for which the total firing rate is given by 
a{t)g[u Q {t)\. 

Gdap^(0 — ~a{t^) J r (^low — 1)^ gl ue (t)\ + 1 U) 

where T adap = 20 ms is the onset time constant (i.e., when the r.h.s of Eq. 2 is negative), x adap = 1 s is the decay time constant (i.e., positive r.h.s.), and 
a low = 25% is the value a approaches at saturation firing rates [Liu & Wang, 2001]. The MEG was assumed to be directly proportional to the 
magnitude of feedback excitation. In simulations, 30 columns were simulated with w ie and w ei , varying linearly from 0.6 to 3.6 across the set of 
columns, which is indirectly justified by the non-uniform distributions of GABAergic cells in auditory cortex [Prieto et al, 1994]. To determine the 
temporal modulation transfer function (tMTF), each column was fed sinusoidally amplitude-modulated periodic input for 2 seconds (amplitude 0.01, 
mod. depth 100%). The tMTF was defined as the amplitude of the first harmonic of the AM frequency of the period histogram of the firing rate 
[Schreiner & Urbas, 1986]. Subsequently, eight 2-s sequences of discrete stimuli (duration 50 ms, r/f 10 ms, amplitude 0.01) and a continuous 2-s 
stimulus were presented to each column. A sequence contained 2, 4, 6, 8, 10, 15, 20, or 40 stimuli (corresponding to ISIs 1950, 650, 390, 279, 217, 
140, 104, or 50 ms, respectively). This provided a set of contributions to the MEG with undetermined magnitudes (i.e., number of columns per type) 
and onset latencies. To specify these, we used the simplex method to search for the minimum squared error between simulated and experimental data. 

Experimental methods: We presented 1000-Hz tones (intensity 70 dB SPL, duration 50 ms) to ten human subjects in the passive recording 
condition using sequences described above (inter-sequence silent period = 10 s). MEG (passband 0.0-100 Hz, sample rate 400 Hz) was recorded with 
a 122-channel whole-head gradiometer. Approximately 65 responses per stimulus sequence were averaged over a period of 3.5 sec, baseline- 
corrected with respect to a pre-sequence baseline of 500 ms and low-pass filtered at 30 Hz. Low-frequency noise was removed with the signal space 
projection method. Epochs including horizontal and vertical eye movement artefacts (voltage displacements >150 juV) were discarded. For each 
subject and stimulation condition, response amplitudes (i.e., field gradient vector sum magnitudes) were analyzed from the channel pair displaying 
the largest NIOOm responses. Responses were normalized according to the peak amplitude of the NIOOm elicited by the first stimulus in the 
sequence. Statistical testing was performed with analyses of variance (ANOVAs) and Newman-Keuls post-hoc tests. 
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RESULTS 


The cell populations in our model exhibited response properties found in the auditory cortex of animal 
models [Schreiner & Urbas, 1986]. Specifically, the tMTFs showed bandpass characteristics, with best 
modulation frequencies (BMFs) in the 0.5-12 Hz range (Fig. 1, top). BMF was an increasing function of the 
strength of feedback inhibition. When repetitive transient stimuli were presented to the model, columns with 
high BMFs responded to both fast and slow stimulation rates with oscillations, the responses to slow-rate 
stimuli consisting of large initial response followed by damped oscillations (Fig 1, middle). Crucially, the 
model predicts that columns with low BMFs respond to stimuli presented at fast rates with sustained responses 
(Fig 1, bottom). This is due to a weak local inhibition, whereby excitatory activity elicited by previous stimuli 
(i.e., their excitatory memory trace) is not extinguished during the short 1ST 

Our MEG results bear out the model predictions (Fig. 2). We found that stimuli presented at ISIs >300 ms 
elicited prominent NIOOm responses. At stimulus rates <300 ms, the stimuli elicited SRs in addition to the 
NIOOm responses. SR magnitude (= mean response in a 100-ms window preceding the onset of the last 
stimulus in the sequence) was an increasing function of stimulus rate (F[9,72] = 17.50, P < 0.001), reaching a 
maximum for stimuli presented at 100 and 50 ms ISIs. Conversely, the transient NIOOm responses decreased in 
magnitude as stimulus rate was increased (F[9,72] = 4.06, P < 0.001). Consequently, with the NIOOm peak 
amplitude and the SR approaching each other as ISI was decreased, the responses elicited by stimuli with ISIs 
below 200 ms resembled the one elicited by the continuous 2-s tone, with a prominent NIOOm response 
followed by an SR. 
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DISCUSSION 

Our neural model of auditory cortex dynamics yields similar cortical tuning to AM frequency as found in 
animal models [Schreiner & Urbas, 1986] and predicts the elicitation of SRs in the MEG even by short 
transient stimuli. Importantly, our model, successfully accounting for MEG results (Fig. 2), suggests that the 
SR and NIOOm can be generated by the same neural populations, and that separate NIOOm and SR generator 
mechanisms need not be postulated. Further, our results shed light on findings [Lu et al, 1992] whereby the 
amplitude of the NIOOm attenuates sharply as ISI is decreased, becoming practically unobservable at ISIs 
<200 ms. This waning of the auditory response would appear to be an artefact of measuring the MEG 
amplitude against a baseline of brain activity immediately preceding stimulus presentation (Fig. 2, 
bottom). Our results clearly show that this pre-stimulus activity is in fact strongly increased by 
stimuli presented at short ISIs, and therefore the NIOOm measured against such a baseline 
drastically underestimates the actual strength of cortical activitation. 

The NIOOm dependence on stimulus rate has been suggested to reflect auditory sensory 
memory [Lu et al, 1992]. Our results suggest that this dependence has two components: previous 
stimulation within a time window stretching relatively far back before stimulus presentation (>1 s) 
diminishes the peak amplitude of the NIOOm; stimulation in a short time window (<300 ms) leads 
to prominent sustained activity being present at stimulus presentation. Therefore the longer and 
shorter forms of auditory sensory memory [Cowan, 1995] may be reflected in the NIOOm and the 
SR, respectively. Further, as the emergence of the SR coincides with the stimuli being perceived as 
streams, the shorter form of auditory sensory memory might provide the necessary mechanism for 
fusing separate stimulus events into perceptual wholes: Although requiring behavioral verification, 
our tentative suggestion is that transient and sustained responses underlie the perception of sounds 
as singular events and streams, respectively. 


Figure 1. Top: AM frequency 
tuning in the model as revealed by 
the MTF for a single column (left) 
and BMFs across columns (right). 
Middle: A high-BMF column 
responds to discrete stimuli with 
oscillations. Bottom: A low-BMF 
column responds to fast stimulus 
rates with a sustained response. 
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rarely occurring stimulus is shown for reference 
(ISI = 10 s; thin curve). 


Schreiner CE, Urbas JV. Representation of amplitude modulation in the auditory cortex of the cat 
I. The anterior auditory field. Hear Res 1986;21:227-41. 


526 




















































































































P4-1 


Plasticity of NIOOm enhancement in patients with acute inner-ear hearing impairment 

T Morita 112 \ H Hiraumi 1 *, N Fujiki 3 *, Y Naito 4 ', T Nagaminc 5 ’, J ito 11 

1) Department of Otolaryngology-Head and Neck Surgery, Kyoto University Graduate School of Medicine, Japan 

2) Brain Research Unit, Low Temperature Laboratory, Helsinki University of Technology, Finland 

3) Department of Otolaryngology and Broncho-Esophagology, Otsu Red Cross Hospital, Japan 

4) Department of Otolaryngology, Kobe City General Hospital, Japan 

5) Department of Brain Pathophysiology, Human Brain Research Center, Kyoto University Graduate School of Medicine, Japan 

ABSTRACT 

We previously reported that the dipole moment for auditory N100m in patients with bilateral inner-ear hearing loss patients was larger and increased 
more rapidly as a function of stimulus intensity than that in healthy subjects, although their peripheral auditory function was injured. It is unclear, 
however, whether this enhancement can show short-term plastic alteration. In the present study, we measured NIOOm in two adult patients with 
sudden deafness at their acute and recovery phases. We presented 1-kHz pure tone burst at four different intensities (40-70 dB SPL) to the affected 
ear with a constant interstimulus interval of 1 s. AEF was recorded with a 122-channel whole-head MEG system. Both patients exhibited NIOOm in 
both hemispheres to 50, 60 and 70 dB stimuli at acute phase (AP) and to all stimuli at recovery phase (RP). In both patients, the dipole moment for 
NIOOm at acute phase was larger in response to 70 dB stimuli and showed steeper increment according to stimulus intensities compared to those at 
recovery phase. The ratio of the dipole moment for NIOOm at 70 dB to that at 50dB was 5.4/1.0/2.4/1.3 (Patient 1 AP/Patient 1 RP/Patient 2 
AP/Patient 2 RP, respectively). In our previous study, it was 3.1/1.4 (patients/healthy groups). These results show that enhanced NIOOm amplitude 
with loudness recruitment phenomenon is expressed in patients with acute inner-ear hearing loss and they can disappear in the scale of days. In 
conclusion, enhanced activity in the auditory cortex derived from inner-ear hearing loss can elicit plastic change. 

KEY WORDS 

Magnetoencephalography, Inner ear hearing loss, Sudden deafness, NIOOm, Plasticity. 

INTRODUCTION 

We previously reported that the dipole moment for auditory NIOOm in patients with bilateral inner-ear hearing loss patients was larger and 
increased more rapidly as a function of stimulus intensity than that in healthy subjects, although their peripheral auditory function was injured 
[Morita, 2003]. It is unclear, however, whether this enhancement can show short-term plastic alteration. In the present study, we measured NIOOm in 
two adult patients with sudden deafness at their acute phase (AP) and recovery phases (RP), and compared them with those of two groups 
(normals/inner-ear hearing impaired patients) in previous study. 

METHODS 

Two patients with sudden deafness were studied at three different time points. Patient 1 (62-year-old female: right ear affected) was measured at 
day 10, 24 and 107 after disease onset, and patient 2 (37-year-old male: left ear affected) at day 8, 12 and 18. Their courses of hearing thresholds at 1 
kHz were presented in table 1. Loudness recruitment, which is the character of inner-ear hearing loss, disappeared before day 24 in Pt 1 and before 
day 18 in Pt 2. Both patients gave informed consent to participate in this study. This study was approved by the Committee of Medical Ethics of the 
Graduate School of Medicine and Faculty of Medicine, Kyoto University. 

A 1-kHz pure tone (15 ms rise/fall and 170 ms plateau times) was presented monaurally at four different intensities (40, 50, 60 and70 dB SPL) 
to the affected ear. The sounds with four different intensities were presented randomly and equiprobably within a single session with a constant 
interstimulus interval of 1 s. Auditory evoked fields was recorded with a 122-channel whole-head MEG system (Neuromag Ltd, Finland). The 
recording passband was 0.03 - 130 Hz, and data were digitized at 419 Hz. The analysis period was 950 ms, from 150 ms before to 800 ms after the 
stimulus onset. The 100 ms pre-stimulus segment was used as the baseline for amplitude measurement. The vertical electro-oculogram (EOG) was 
used to exclude epochs contaminating eye movements or blinks. About one hundred epochs were averaged for each stimulus. After digital low-pass 
filtering at 40 Hz, two equivalent current dipoles (ECDs) were used to explain the magnetic field distribution. 

Both subjects underwent three-dimensional MR imaging. A single sphere model was used for the source modeling. Each subject’s sphere was 
determined from one’s own MRI data. 


RESULTS 

Both patients exhibited N100m in both hemispheres to 50, 60 and 70 dB stimuli at acute phase and to all stimuli at recovery phase (table 1). In 
both patients, the dipole moment for NIOOm in response to 70 dB stimuli at acute phase was larger than those at recovery phase especially in 
contralateral hemisphere. When comparing these moments in contralateral hemisphere to 70 dB stimuli at acute phase to those in previous study 
using probability of t distribution, there is significant differences to normal groups (p < 0.0001/< 0.0001, Patient 1/Patient 2), and not to patients 
group (p =0.8/0.8, Patient 1/Patient 2). 
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The dipole moment at acute pahse 
showed steeper increment according to 
stimulus intensities compared to those 
at recovery phase. In contralateral 
hemisphere to the stimuili, the ratio of 
the dipole moment for NIOOm at 70 
dB to that at 50dB was 5.4/1.0/2.4/1.3 
(Patient 1 AP (day 10)/Patient 1 RP 
(day 24)/Patient 2 AP (day 8)/Patient 2 
RP (day 18), respectively). In 
ipsilateral hemisphre, the ratio of the 
dipole moment for NIOOm at 70 dB to 
that at 50dB was 5.2/1.3/1.6/1.1 


Table 1. Dipole moments for NIOOm in contralateral hemisphere to the stimulation [nAm] 

Mean (± standard error of mean) moments in previous study [Morita, 2003] were presented for comparison 


Hearing threshold Intensity [dB SPL] 

at 1 kHz [dB SPL] 40 50 60 70 


Patient 1 

AP (day 10) 

45 

- 

8.3 

36.3 

44.6 


RP (day 24) 

25 

21.3 

20.1 

24.1 

19.9 


RP (day 107) 

10 

15.2 

25.7 

25.2 

26.3 

Patient 2 

AP (day 8) 

35 

- 

18.8 

24.1 

44.8 


RP (day 12) 

20 

19.9 

25.2 

24.1 

24.1 


RP (day 18) 

10 

16.7 

23.7 

29.5 

31.2 

Normals (mean±SEM) 


13.2=1=1.2 

15.3=1=1.0 

19.1=1=1.3 

20.3±1.3 

Patients (mean±SEM) 


- 

16.3=1=2.2 

32.3=1=3.7 

43.7±4.3 


(Patient 1 AP (day 10)/Patient 1 RP (day 24)/Patient 2 AP (day 8)/Patient 2 RP (day 18), respectively). In our previous study, the average (±SEM) of 
this ratio was 3.1±0.3/1.4±0.1 in contralateral hemisphre and 2.5±0.3/1.5±0.1 in ipsilateral hemisphere (patients/healthy groups, respectively). 


DISCUSSION 

These results show that enhanced NIOOm amplitude with loudness recruitment phenomenon is expressed in patients with acute inner-ear hearing 
loss and they can disappear in the scale of days. 

In animal experiments, increased amplitude of auditory cortical responses in association with cochlear injury has been reported [Popelar, 1987] 
[Syka, 1994] [Qiu, 2000] [Salvi, 2000]. These reports suggested that suppresion of inhibition at or around inferior colliculus in central auditory 
pathway following cochlear damage caused enahncement of cortical acitivation. Syka reported using rats that temporal inner-ear damage caused by 
noise exposure could make amplitude enhancement of auditory cortical responses and this enhancement in cortical responses dissapeared in the scale 
of days following recovery of hearing threshold [Syka, 2000]. This further supports our findings. The present study is the first in humans to have 
confirmed this enhancement of auditory cortical responses can show short-term plastic alteration. 

In conclusion, enhanced activity in the human auditory cortex derived form inner-ear hearing loss can elicit plastic change. 
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ABSTRACT 

To investigate the neural mechanisms underlying in speech perception, we examined the magnetic counterpart of mismatch negativity 
(MMNm) elicited by short and long vowels in Japanese and French speakers. MMNm was recorded from seven right-handed Japanese subjects (four 
male) and six right-handed French subjects (four male) by a helmet-shaped 306-channel detector array (Vectorview, Neuromag Ltd.). Stimuli were 
Japanese nonword-sounds [erepe] and [ere:pe] (vowel length contrast) vocalized by a Japanese male. Clear MMNm responses were observed 
bilaterally in the all subjects in all conditions. In the Short condition where the averaged responses to [erepe] as standard stimuli were subtracted 
from those to [erepe] as deviant ones, the MMNm strength of French subjects did not differ significantly between the both hemispheres. The MMNm 
responses of Japanese subjects were stronger significantly in the left than in the right hemisphere (p<0.02). In the Long condition where the averaged 
responses to [ere:pe] as standard stimuli from those to [ere:pe] as deviant ones, the MMNm of French subjects was elicited stronger in the left than in 
the right hemisphere, while Japanese subjects did not show significant difference between the both hemispheres. The pattern of cortical magnetic 
responses was different between Japanese and French speakers. It is thus shown that the cortical activities to speech sounds might be influenced by 
one’s native speech sounds. 

KEY WORDS 

Speech perception; Magnetic counterpart of mismatch negativity; Magnetoencephalography; Phoneme; French; Japanese; Long vowel 

INTRODUCTION 

Since phonetic organization differs between languages, adult language learners sometimes have difficulty perceiving differences among 
phonemes used to distinguish words in their target languages. For example, long vowels are not phoneme in the French phonological system, 
whereas Japanese has the phonological contrast between short and long vowels, for instance [tori] "bird" and [to:ri] "street”. Recent studies have 
explored the left-hemisphere specialization at the preattentive and automatic level of speech sound processing by using the mismatch negativity 
(MMN) event-related potential (ERP) and its magnetic counterpart (MMNm), which reflect the brain automatic auditory change-detective process. 
The aim of this study is to investigate the neural mechanisms underlying in speech perception by examining MMNm elicited by short and long 
vowels in Japanese and French speakers. 

METHODS 

Seven healthy right-handed native Japanese speakers (four males) and six healthy right-handed native French speakers (five males) participated in 
the experience. Stimuli were two different Japanese nonwords, [erepe] which consisted of vowel-consonant-vowel-consonant-vowel syllables and 
[ere:pe] (long vowel embedded in the third morae), vocalized by a Japanese male. They were binaurally delivered at intervals of 800 ms. Two 
sequences were provided as follows: 1) [erepe] as a deviant (15%), [ere:pe] as a standard (85%), 2) [ere:pe] (15%), [erepe] (85%). The evoked 
magnetic field was recorded (passband 0.1-173.6Hz, sampling rate 1001.6Hz) with 204 planer gradiometer channels of a helmet-shaped 306-channel 
detector array (Vectorview, Neuromag Ltd.). All subjects were instructed to ignore the sound stimuli and watch a silent movie. During the auditory 
stimulation MEG epochs were online averaged separately for the different types of the standard and deviant stimuli, starting from 100 ms before and 
ending 500 ms after the stimulus onset. The MMNm was obtained by subtracting the averaged responses to the standard stimuli from those to the 
deviant ones. The responses were evaluated separately for each subject by the approach of single-dipole fit. Forty-six gradiometer channels on each 
side of the helmet-shaped magnetometer were used for assessing cortical responses in the left and right hemispheres. By means Neuromag sequential 
single-dipole fitting software, the generator sources (equivalent current dipoles: ECDs) of the MMNm were estimated by 1 mm steps. The strengths 
of the ECDs obtained were compared in Wilcoxon signed rank test with factors native language (Japanese vs. French), condition ([ere:pe] vs. [erepe] 
deviant-standard contrast) and hemisphere (left vs. right). 

RESULTS 

Clear mismatch MMNm was observed bilaterally in the all subjects in the all conditions. Fig.l shows the mean ECD strength of MMNm 
estimated in the auditory cortex. In the Short condition where the averaged responses to [erepe] as standard stimuli were subtracted from those to 
[erepe] as deviant ones, the MMNm strength of French subjects did not differ significantly between the both hemispheres. The MMNm responses of 
Japanese subjects were stronger significantly in the left than in the right hemisphere (p<0.02). In the Long condition where the averaged responses to 
[ere:pe] as standard stimuli from those to [ere:pe] as deviant ones, the MMNm of French subjects was elicited stronger in the left than in the right 
hemisphere (p<0.08), while Japanese subjects did not elicit significantly different responses between the both hemispheres. 

DISCUSSION 


The pattern of cortical magnetic responses was different between Japanese and French speakers, suggesting that the cortical activities to 
speech sounds are influenced by one’s native speech sounds and the linguistic environments could change auditory cortical processing. In the case of 
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Long condition, the MMNm elicited by French subjects was stronger in the left than in the right hemisphere, which is accounted from the prosodic 
system that “rhythm accent” is positioned in the last syllable of “rhythm group” defined as a syntactic and semantic group. Thus, French speakers 
perceive “rhythm accent” in the section of long vowel embedded in the word [ere:pe]. Therefore, this faculty might facilitate detecting long vowels 
in syllable-timed sequences. On the other hand, the MMNm elicited by Japanese subjects was stronger in the left than in the right hemisphere in the 
case of the Short condition, indicatiing that the MMNm could be strongly influenced by the short vowel. 
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ABSTRACT 

Previous MEG and EEG studies reported that the amplitude of Nlm was attenuated by a preceding sound. In this study we recorded the auditory 
magnetic fields (AEF) evoked by two test sounds following preceding sounds to investigate the effects of (1) stimulus frequency, and (2) inter¬ 
stimulus interval (ISI) between a test and the preceding sound. 

The preceding sound was a comb-filtered noise (CFN), which was derived from white noise by suppressing regularly spaced frequency regions, 
with duration of 3.0 seconds. The spectral powers of CFN were maximal in the pass-band frequency regions and minimal in the stop-band regions. 
As test stimuli we adopted two complex amplitude modulated tones. One stimulus was composed of the frequencies corresponding to the pass-band 
sections of CFN, PB stimulus, and the other was composed of the stop-band frequencies of CFN, SB stimulus. The ISIs were 0.5, 1.0, and 2.0s. As a 
result, the Nlm responses to SB and PB declined as the ISIs became shorter and the Nlm decrement was significantly larger for the SB than the PB 
with ISI of 0.5s and 1.0s. In the case of PB, the receptive neurons were already activated by the CFN thus the stimulus specific adaptation would 
cause the Nlm decrement. On the contrary, the receptive neurons of SB stimulus were not activated by the preceding CFN. They would be inhibited 
by lateral connections from the receptive neurons of the CFN. This study suggested that the lateral inhibition might be a stronger inhibitory system 
than the stimulus specific adaptation. 

KEY WORDS 

MEG, human auditory cortex, lateral inhibition, stimulus specific adaptation, habituation, inter-stimulus interval, 

INTRODUCTION 


A stimulus specific adaptation of the N1 response was already reported in early EEG experiments. Butler [Butler, 1968] presented three 
intervening tone-bursts between 1000 Hz test tones of 4 to 5 s SOA, and observed the strongest decrement of the N1 response, when the test stimuli 
and the intervening tones had equal frequencies or were closely spaced. The effect has been interpreted as less interaction between different neuronal 
populations, which were activated by more different frequencies 

In addition to the stimulus specific adaptation, cortical responses may be affected by sound in spectral ranges different from the one of the test 
stimulus. Pantev et al. [Pantev, 1999] demonstrated that after prolonged stimulation with music, from which the power in a specified narrow 
frequency band was completely removed, neurons in the auditory cortex responsive to frequencies falling within the notch were strongly inhibited. 
This result was interpreted as reflecting the long lasting effect of a lateral inhibitory process, i.e. the inhibitory influence of the stimulated neurons 
onto those neighboring neurons whose characteristic frequencies were within the notched spectral range. Pantev et al. [Pantev, 2004] also 
demonstrated the Nlm decline as effect of the lateral inhibition on a shorter time frame. After exposure to comb-filtered noise (CFN) for 3.0 s the 
Nlm amplitude decrement was larger for a stimulus with spectral components corresponding to the stop-band sections of the noise compared to the 
stimulus corresponding to the pass-band sections. It was concluded, therefore, that the lateral inhibition has a more pronounced effect than the 
stimulus specific adaptation. 

In the study of Pantev et al. [Pantev, 2004] the test stimuli were presented with a silent interval of constantly 500 ms duration between the 
interfering noise. Thus the temporal dynamic of the effect was not investigated until now and it is not clear, how the Nlm decrement changes 
depending on the ISI. 

The goal of this study was to investigate the effect of the silent interval between the offset of the masker sound and the onset of the test stimulus 
(TS) on the Nlm amplitude of the AER elicited by the TS. Of particular interest were the different effects of the CFN masker on the TS consisting of 
spectral components equivalent either to the pass-band frequencies of CFN or to the stop-band frequencies. The results are expected to gain 
information about the temporal dynamics of adaptation and lateral inhibition in the central auditory system. In this study, we use the term stimulus 
specific adaptation for the Nlm decrement caused by a preceding sound that has the overlapping spectral range of the stimulus. 


METHODS 

Ten right-handed subjects (four females, 30.3±6.7 years) participated in this study. Their 
hearing thresholds were below 15 dB nHF in the frequency range from 250 to 8000 Hz. The 
subjects gave their informed consent for participation in the study, which was in accordance 
with the Declaration of Helsinki and approved by the Ethics Commission of the Baycrest 
Centre for Geriatric Care. 

The auditory stimuli were two types of complex tones and a noise signal which were 
defined in frequency domain. The noise signal was derived from digitally filtered white 
noise. Its spectrum consisted of an alternating series of pass- and stop-band sections with 
equal width of one-quarter octave and a typical suppression of 36 dB in the stop-bands. The 
noise signal was termed "comb-filtered noise" (CFN) because of its spectral shape. The test 
stimuli (TS) were composed from five spectral components each corresponding either to the 
centers of pass-band sections of the CFN and hence termed as pass-band stimuli (PB) or to 
the stop-band sections of the CFN and termed as stop-band stimuli (SB). 



0.5 0.7 1.0 1.4 2.0 2.8 

Frequency (kHz) 


Figure 1. The amplitude spectrum of the comb- 
filtered noise (CFN). 
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The TS of 500 ms duration including 12.5 ms rise and decay times were repeatedly presented with a constant SOA of 6.0 s. Between two 
consecutive TS an interfering CFN burst of 3.0 s duration occurred with an ISI of 0.5, 1.0 or 2.0 s between the offset of the CFN burst and the onset 
of the following TS. No interfering noise was presented in the control condition. 

To investigate the effect of the CFN onset response, a CFN of 140 ms was used in the comparative condition. This short duration stimulus was 
capable to evoke an onset response, however, the long lasting exposure to the CFN was avoided. 

The amplitude of the TS and the CFN-sounds were adjusted for equal loudness after subjective tests on a group of ten subjects. The stimuli were 
presented binaurally at intensity of 45 dB above individual sensation level. 

Auditory evoked magnetic fields (AEF) were acquired with a helmet-shaped 151-channel whole cortex neuro-magnetometer (OMEGA, CTF 
Systems Inc) in a quiet magnetically shielded room. In order to keep them in an alert state, a silent movie was presented during the MEG recordings. 

The magnetic fields of single equivalent current dipoles (ECD) in both hemispheres were independently approximated to the averaged magnetic 
field distribution at the latency around the maximum global field power about 100 ms after stimulus onset. The dipole locations and orientations were 
determined in a head based Carthesian coordinate system. Nlm amplitudes evoked by TS following the interfering CFN with ISI of 0.5, 1.0, and 
2.0 s were normalized with respect to Nlm amplitudes obtained in the control condition. A repeated measures ANOVA with the normalized amplitude 
as dependent variable and three within group factors (stimulus type: pass-band and stop-band; ISI: 0.5, 1.0, and 2.0 s; hemisphere: left and right) was 
applied and the p-values smaller than 0.05 were accepted as significant. 

RESULTS 

The analysis of variance applied to the normalized Nlm amplitudes obtained with 3.0-s interfering noise resulted in significant effects of the 
factors stimulus type (F(l,9)=9.67, GG corrected p=0.013) and ISI (F(2,18)=9.43, p=0.010), no effect of the hemispheres, and a significant 
interaction between the stimulus type and the ISI (F(2,18)=6.21, p=0.011). These statistical results indicated that larger Nlm decrements were 
obtained for the SB than PB stimuli and for shorter than longer ISIs. Post-hoc analyses (paried t-tests, df=9) revealed significantly larger Nlm 
amplitude reduction for the SB stimulus compared to PB stimulus at ISI of 0.5 s (t=4.25, p=0.0011), and 1.0 s (t=2.21, p=0.027), as well as a 
tendency at ISI=2.0 s (t=1.74, p=0.058). For the SB stimulus significantly smaller Nlm amplitude ratios were obtained, when the ISI was shortened 
from 2.0 to 0.5 s (t=3.4234, p=0.0038), from 2.0 to 1.0 s (t=3.9159, p=0.0017), and from 1.0 to 0.5 s (t=2.1548, p=0.030). Significantly smaller Nlm 
amplitude ratios were found for the PB responses, when the ISI was shortened from 2.0 to 0.5 s (t=2.4525, p=0.018), and from 2.0 to 1.0 s (t=4.4706, 
p=0.0008). In case of the 140 ms duration CFN, no significant Nlm decrement was observed for the different SOAs applied. 

DISCUSSION 

Since the stimulus sequence was rather complex it was worthwhile to experimentally determine which stimulus parameter effectively causes the 
observed replicated effect. In the control condition, a short CFN-sound (140 ms) elicited a similar onset Nlm response as the CFN-sound of 3.0 s 
duration. The short CFN-sound caused no decrement of the Nlm in response to the TS regardless of the various SOAs. These results imply that the 
onset response to the 3.0 s CFN-sound had no substantial effect on Nlm decrements seen following the TS, and that the exposure to the 3.0 s CFN- 
sound was itself the main cause for these decrements. 

The stimuli and masking sounds were presented at an intensity of 45 dB SL, at which the frequency specificity in the auditory system is still well 
established. Nevertheless, from the tuning curve width at the level of the auditory nerve substantial overlap between the stop-band and pass-band 
sections of the CFN has to be expected. Without further contrast enhancement in the central auditory system a strong effect as seen in the present data 
seems rather unlikely. Thus, the observed effect can be considered as evidence for a central contrast enhancement. 

The PB stimulus activates the neurons that had been previously activated by the preceding CFN. Consequently stimulus specific adaptation 
explains the main effect causing the decrement of the Nlm amplitude. In the case of the SB stimulus, frequency bands neighboring the stimulus were 
excited during exposure to the CFN. Hence, the residual effect of lateral inhibition can explain the Nlm decrement as suggested by Pantev et al. 
[Pantev, 2004]. This explanation is based on the fundamental organization of sensory systems in parallel pathways and multiple representations in 
parallel maps at subsequent stages along the sensory pathway. Inhibitory lateral connections are a common model for contrast enhancement. Spectral 
contrast induced by the CFN results in imbalance of inhibition and excitation in a layer of the neural network. Neurons best responding to frequencies 
in the pass-band section of the CFN, especially those corresponding to frequencies close to the spectral edges, receive excitatory input from the CFN 
and lack inhibitory input from neighbored stop-band frequencies. Vice versa, neurons responding to the stop-band frequencies do not receive 
excitatory input from the CFN, but inhibitory input from neighbored pass-band sections. This results in an enhanced representation of the spectral 
contrast introduced by the CFN. After release of the CFN burst the lateral inhibition decays with a certain time-constant. Thus, the excitation caused 
by the following SB stimulus competes with the remaining inhibition and consequently the Nlm response is reduced as compared to the Nlm 
response without preceding CFN. Until now, the time-course of the Nlm decrement induced by lateral inhibition was not demonstrated. 
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ABSTRACT 

Down syndrome (DS) is characterized by mental deterioration and dementia at least partly attributed to cholinergic damage. We investigated 
with a whole-head MEG whether DS shows delayed auditory processing similar to that observed in Alzheimer’s disease (AD). MEG measurements 
were performed in ten patients with DS, and eight healthy controls. Auditory tones were monaurally presented to all subjects, and the auditory 
evoked fields (AEF) were measured with 306-channel MEG-system. Patients with DS had significantly delayed P50m and NIOOm responses over 
both hemispheres. The NIOOm response was decreased in amplitude contralaterally only, whereas there were no difference of the P50m amplitudes 
between the groups. Present results indicate delayed cortical auditory processing underlying stimulus detection in DS. Based on differences in AEFs 
amplitudes, our results suggest that P50m and NIOOm generators have partly different sensitivity to Down pathology. Given that cholinergic system 
modulates P50m and NIOOm responses, cholinergic damage in DS could contribute to impaired auditory processing. 

KEY WORDS 

Down syndrome, EEG, MEG, Auditory evoked fields, Event related potentials 

INTRODUCTION 

DS is usually caused by an extra chromosome 21, and is characterized by delayed neurodevelopment during childhood and mental 
deterioration during middle age. The typical developmental stage of a Down patient corresponds to that of a four to seven year-old child. DS patients 
have retarded physical and mental development. Interestingly, brain pathology in DS has several similarities with that of AD, such as accumulation of 
senile plaques and neurofibrillary tangles, accompanied by destruction of ascending cholinergic neurons [Kavas, 1999]. Event-related potential (ERP) 
studies in DS have revealed accelerated neural processing at the level of brainstem, and delay in subsequent cortical processing [Diaz, 1999]. EEG- 
and MEG-studies have shown impaired cortical auditory processing in AD as well [Green, 1992], [Pekkonen, 1999]. The aim of the present study 
was to investigate with a whole-head MEG whether DS patients without clear symptoms of dementia have similar functional abnormalities of 
auditory processing as observed in AD. 

METHODS 

MEG measurements were performed to ten non-demented patients with DS, and eight age- and gender-matched healthy controls. Institutional 
ethical committee approval was obtained. Monaural sinusoidal tones with intensity of 60 dB over the hearing threshold were presented to all subjects 
with interstimulus interval of 1 s. The auditory evoked fields (AEF) were measured using 306-channel MEG-system (Neuromag). The recording 
passband was 0.03-100 Hz, with a sampling rate of 397 Hz. Vertical and horizontal electro-oculograms (EOG) were recorded to exclude blinking 
artifacts. Peak latency and amplitude values of the P50m and NIOOm responses were measured from the channel pair showing the largest responses 
over each hemisphere. The MEG data were analysed using a 2-way repeated measures of ANOVA (group by hemisphere) and subsequent unpaired 
two-tailed Etests to assess possible group differences for the latencies and amplitudes of the AEFs. 

RESULTS 


Ipsilateral hemisphere Contralateral hemisphere 




Figure 1. Auditory evoked magnetic responses over each hemisphere from one patient and one control subject from the channels where responses 
were the largest. Patients with DS had significantly delayed P50m and NIOOm responses over both hemispheres, and the NIOOm response was clearly 
decreased in amplitude. 


533 















P4-1 


Auditory P50m and NIOOm responses were significantly delayed over both hemispheres in Down patients as compared with control subjects 
(Figs 1 and 2). NIOOm response was significantly attenuated contralaterally to the ear stimulated, whereas no significant differences in P50m 
amplitude were observed between the groups. 
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Figure 2. The P50m and NIOOm latencies were significantly delayed in Down patients. Right: Contralateral hemisphere; Left: Ipsilateral hemisphere. 
* p <0.05, ** p < 0.01. 


DISCUSSION 

This is the first MEG-study indicating divergent cortical auditory processing underlying stimulus detection in DS patients without clear signs 
of dementia as shown by delayed auditory P50m and NIOOm responses. Our results are also in line with previous EEG studies showing delayed 
cortical auditory processing [Diaz, 2001]. Given that only NIOOm was decreased in amplitude, our results suggest different sensitivity of P50m and 
NIOOm generators to Down pathology. Based on finding that P50m and NIOOm generators are modulated by the cholinergic system [Peklconen, 
2001], divergent auditory processing observed in DS might be linked to cholinergic deficit. 
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ABSTRACT 

40-Hz auditory steady state responses to amplitude modulated tones were recorded with magnetoencephalography from primary auditory cortex 
investigating the effect of focused attention. A modulation discrimination task and a destructive visual task established the attended and the non- 
attended experimental conditions. A strong contrast between these conditions was demonstrated by largely enhanced sustained responses under the 
attention condition. A significant attentional effect on the AS SR amplitude was observed mostly in the left hemisphere between 200 to 500 ms after 
stimulus onset. In contrast, transient gamma-band and N1 responses were not affected by the different states of attention. 

KEY WORDS 

Attention, auditory steady-state responses, sustained magnetic field, magneto-encephalography 

INTRODUCTION 

Oscillatory neuronal activity in the 40-Hz range is discussed related to feature binding and cognition. Transient 40-Hz response, which is evoked 
immediately after the auditory stimulus onset, is enhanced under active attention [Tiitinen, 1997][Debener, 2003]. Auditory steady-state response 
(ASSR) is another type of oscillatory, stimulus-locked 40-Hz activity for which a previous EEG study gave no evidence for an effect of selective 
attention [Linden, 1987]. The functional meaning of the ASSR is widely unclear. Thus, a definite answer to the question of whether the ASSR 
changes with attention, is of significant interest. Whole head magneto-encephalography (MEG) was used to record ASSRs to amplitude modulated 
(AM) tones. MEG gives a better signal-to-noise ratio than EEG; therefore, we expect more detailed information about the dependency of the ASSR 
on attention than it was feasible previously. A modulation discrimination task forced the subjects to listen actively to the auditory stimuli. The 
described experiment was part of a series of investigations including monaural and dichotic listening tasks. 

METHODS 

Twelve healthy, normal hearing, right-handed subjects (five female) between the age of 23 to 54 years participated in the study, which has been 
reviewed by the Ethics Commission of the Baycrest Centre for Geriatric Care. Auditory stimuli were 40-Hz amplitude modulated (AM) 500-Hz tone- 
bursts of 800 ms duration. Stimuli were monaurally presented to the right ear at inter-stimulus intervals between 2.5 to 3.2 s at 60 dB above sensation 
level. In the attended condition 10% of the stimuli were presented with the modulation frequency changed to 30 Hz as shown in Fig.l. Subjects 
discriminated the modulation frequency and responded with a right-hand button press to the target stimuli. In the non-attended condition the subjects 
watched a slide show and counted the number of pictures in three categories; 
landscapes, animals, or humans. Visual and auditory stimuli were presented under 
attended and non-attended conditions, which only differed in the instructions and 
the performed task. MEG was recorded with a 151-channel whole head 
magnetometer (OMEGA, CTF Systems Inc, Port Coquitlam, Canada) in a silent 
magnetically shielded room. ASSR sources were localized by approximating the 
magnetic fields of single equivalent current dipoles in each hemisphere to the 32- 
48 Hz band-pass filtered averaged field data. Sustained field (SF) and N1 sources 
were estimated after 24-Hz low-pass filtering of the data. Time-series of magnetic 
dipole moments of the ASSR and SF were calculated based on their source 
coordinates. 

RESULTS 

ASSR sources were found in more medial parts of Heschl’s gyri as compared to N1 and SF sources. Thus, ASSRs are most likely generated in the 
primary auditory cortical areas. All subjects perfonned the AM discrimination correctly with mean response latency of 950 ms (830 ms after onset of 
modulation change). Figure 2 shows the time-series of grand-averaged responses to the standard stimuli obtained from the left hemisphere (i.e., 
contra-lateral to the stimulated right ear) under the attended and non-attended conditions. SF responses were clearly enhanced in the attended 
condition, whereas the Pl-Nl-waves exhibited no differences between the conditions (Fig. 2b). In eleven out of 12 individuals, the mean SF 
amplitude in the interval between 300 and 800 ms after stimulus onset was larger when subjects attended to the auditory stimulus (Fig. 2c). Grand- 
mean SF amplitude was 60% larger in the attended than the non-attended condition. An additional increase in the SF amplitude becomes obvious in 
the latency interval of 400 to 500 ms. The ASSR time-course is characterized by an initial linearly rising slope during the first 200 ms after the AM 
onset, then an interval of enhanced ASSR amplitude around 300 ms, followed by a steady-state interval with constant amplitude, and finally a fast 
amplitude decay after stimulus offset (Fig. 2d). Enhanced ASSRs in the attended condition are obvious in the graphs of the amplitudes resulting from 
Hilbert-transfonn of the ASSR waveforms (Fig. 2e). Horizontal bars in Fig. 2e denote the time intervals of significant amplitude differences. The 
most prominent effect occurs between 200 and 500 ms after stimulus onset. Ten of twelve subjects showed larger ASSR amplitudes during this 
interval (Fig. 2f). The ASSR responses were larger in the right hemisphere, ipsi-lateral to the stimulated ear. However, the ASSR enhancement during 
auditory attention was more pronounced in the left hemisphere. Transient 40-Hz responses were more pronounced in the left hemisphere contra¬ 
lateral to stimulation, however no differences between the experimental conditions were found. 
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Fig.l: ‘Target’ and ‘standard’ stimuli with 30- and 40-Hz 
amplitude modulation, respectively. 
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DISCUSSION 

An enhanced SF presented in the current study is consistent 
with a previous report [Picton, 1978] and indicates that the 
experimental conditions were sufficient to elicit attentional effects. 

A main attentional effect was an augmented ASSR amplitude 
between 200 to 500 ms after stimulus onset. This time interval 
corresponds to the onset of the modulation frequency change and is 
likely the most relevant time interval for AM discrimination. An 
enhanced ASSR may reflect the time-course of a dynamic filter on 
a perceptual level, thus facilitating the discrimination process 
during this time interval. Interestingly, the time-course of the SF 
showed increased amplitude in the same time interval. 

The onsets of the standard and target stimuli were identical and 
thus they did not bear relevant information for the discrimination 
task. This might explain why the results of the present experiment 
showed no effect of attention on the transient gamma-band and N1 
responses. 

In the experimental procedure used by Linden et al. [Linden, 

1987], who found no evidence for an effect of attention on the 
ASSR, intensity and frequency discrimination tasks were 
performed in the attended conditions. These tasks therefore, 
required no specific attention to the stimulus rhythm eliciting the 
40-Hz ASSR. In contrast, the AM discrimination task in the present 
study required focused attention to the modulation rhythm and was 
more specific for the ASSR generation. Furthermore, the single 
channel EEG recording used by Linden et al. could not distinguish 
between cortical and sub-cortical sources of ASSR [Herdman, 

2002]. Both may be affected differently by attentional changes. The 
present MEG data demonstrated that the ASSR with origin in 
primary auditory cortex are enhanced during specific attention. 

The presented data showing larger effects of attention in the left 
than the right hemisphere cannot explain whether this is a 
preference for the left hemisphere or the hemisphere contra-lateral 
to the stimulated ear. Further investigations with left ear stimulation 
and dichotic listening are in progress. However, the hypothesis of 
left hemispheric dominance is supported by findings of Zatorre et 
al. [Zattore, 2001], who found that the core auditory cortex in the 
left hemisphere is more specialized for processing of rapid 
temporal changes than core regions in the right hemisphere, which 
are more specialized for spectral processing. 
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Fig. 2: Group averages (n=12) of response waveforms to standard stimuli 
obtained from the left hemisphere, contra-lateral to the stimulated right 
ear. a: stimulus b: source waveforms of the late-evoked responses and 
sustained field (SF), c: individual SF amplitude differences between 
attended and non-attended condition and the grand mean with error bars 
denoting the 95% confidence limits, d: ASSR waveforms, e: time-course 
of the ASSR amplitudes (Horizontal bars denote significant amplitude 
differences), f: individual and grand-mean ASSR amplitude differences. 
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ABSTRACT 

Human cortical responses to the change in spaciousness of sound were recorded with the method of magneto-encephalography (MEG). The 
phases of dichotically presented 500-Hz tones were shifted so that the sound was perceived as originating either from a point-like source centered in 
the head or from separated sources in space. The phase shift was embedded in 40-Hz amplitude modulation. Thus, the phase shift could not be 
detected from a monaural signal. The transition between ‘mono’ and ‘pseudo-stereo’ quality of the sound elicited a P1-N1-P2 response similar to the 
onset response as well as a decrement in the steady-state response. The responses were discussed as reflecting binaural processing in the central 
auditory system. 

KEY WORDS 

Spatial hearing, dichotic listening, binaural interaction, pseudo-stereo sound, auditory space perception, MEG 

INTRODUCTION 

Listening to a stereo-phone recording of music via a pair of headphones induces the illusion of a spacious sound outside the head. In contrast, a 
mono-phone sound (i.e., same sound to both ears) is perceived as originating from a point-like source centered inside the head. The illusion of a 
sound in space is induced by central processing of multiple acoustical parameters like the inter-aural time- and intensity differences and correlations 
between sounds in the left and right ears [Blauert, 1983]. A single sound can be modified to create the illusion of spatial sound by phase shifting the 
sound between both ears [Schroeder, 1961]. Such a sound is called a ‘pseudo-stereo’ sound. The aim of this study was to record cortical responses to 
a change in the stimulus quality, which could only be perceived by central binaural processing. The observed brain activity may serve as a 
complement or replacement of the binaural interaction component, which is conventionally used to indicate central auditory processing. 

METHODS 

Twelve healthy, normal hearing, right-handed subjects (seven female) of 24 to 50 
years of age participated in the study, which was reviewed by the Ethics Commission of 
the Baycrest Centre for Geriatric Care. Stimuli were 40-Hz amplitude modulated (AM) 
tone-bursts (500 Hz) of 4,000 ms duration. Stimuli were dichotically presented at an 
intensity of 60 dB above the individual sensation level through earphones connected with 
plastic tubes and silicon ear-pieces. At 2,000 ms, half the stimulus duration, a phase shift 
of 90° was introduced in the right-ear carrier signal and a phase shift of -90° in the left-ear 
carrier signal (Fig. la). Thus, signals were in phase for the first two seconds (‘mono’ 
condition) and out of phase for the last two seconds (‘pseudo-stereo’ condition). A total of 
128 stimuli were presented with an ISI of 4±0.5 s between succeeding stimuli. A second 
block contained 128 stimuli, which were out of phase for the first 2 s and in phase 
thereafter. MEG was recorded with a 151-channel whole head magnetometer (OMEGA, 

CTF Systems Inc, Port Coquitlam, Canada) in a silent magnetically shielded room with 
the subjects in seated position. Subjects passively listened to the stimuli while watching a 
soundless movie. The transient auditory evoked response (P1-N1-P2 waves) and the 
auditory steady-state response (ASSR) sources were localized by approximating the 
magnetic fields of single equivalent current dipoles in each hemisphere to the 0-24 Hz and 32-48 Hz band-pass filtered averaged data, respectively. 
Time-series of magnetic dipole moments were calculated based on the estimated source coordinates. 

RESULTS 

All subjects reported to clearly perceive the sound source centered inside the head shift to distributed sound source in space. Clearly identifiable 
transient and steady-state responses were recorded from all subjects. The grand averaged response waveforms exhibit P1-N1-P2 waves after the 
stimulus onset and the transition from mono to pseudo-stereo quality (Fig. 2). The low-pass filtered waveforms also reveal a distinct sustained field 
and an ‘off’ response as well. The N1 amplitude was about 10% larger at the onset of a pseudo-stereo than a mono stimulus (p<0.01). Additionally, 
N1 amplitude was about 10% larger for the transition from mono to pseudo-stereo as compared to the reverse transition (p<0.01). A P1-N1-P2 
response to the phase shift was superimposed on the sustained field and the N1 amplitude resembled the N1 onset response when measured with 
reference to the pre-stimulus baseline. The Pl-Nl response amplitude to the phase-shift, however is 25% smaller than onset Pl-Nl response 
amplitude. No significant differences in the P1-N1-P2 responses were found between hemispheres. In contrast, ASSR amplitude was significantly 
larger in the right hemisphere. Time-course of the 40-Hz response showed the transient response after stimulus onset, a shallow slope of the ASSR 
onset, and the almost constant amplitude in the steady-state. Immediately after the phase shift, the ASSR amplitude was noticeably reduced for 
250 ms. Fig. 3 shows the individual results of a control condition, in which the phase shift in the stimulus signal followed the same direction in both 
ears. Thus, the stimulus contained a phase shift but it did not elicit any perceived change in sound source location or neural responses to the phase 
shift. Consistent with previous results, ASSR sources were significantly more medially located than the N1 responses. This suggests that ASSR are 
generated in the primary auditory cortex and N1 responses are generated in primary and non-primary auditory cortices. Detailed results of the source 
localization using the method of synthetic aperture magnetometry were recently presented [Herdman, 2003] 
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DISCUSSION 

A phase shift of 90° as introduced in the 500 Hz tone is clearly perceived 
when presented monaurally. Such a phase shift produces a small dip in 
amplitude of less than 10 ms duration at the output of the auditory filter 
[Shailer, 1987]. However, when the phase-shift is embedded in the 25 ms 
wide trough of the amplitude modulation it cannot be sensed. The control 
condition in the present study demonstrated clearly that no evoked response 
is elicited by the phase-shift in the monaural stimulus. This indicates that the 
binaural processing of inter-aural time differences elicits the auditory 
responses. 

Almost identical P1-N1-P2 responses were recorded from both 
hemispheres with prominent N1 amplitudes. Generators of the N1 wave are 
commonly assumed to be in non-primary auditory cortical areas. Utilizing 
40Hz amplitude modulated stimuli in this study allowed us also to 
simultaneously record the AS SR, which is most likely generated in the 
primary auditory cortex. Pronounced decrease in the AS SR amplitude after 
the ‘mono’ to ‘pseudo-stereo’ transition clearly indicated that the binaural 
processing is already performed in the primary auditory cortex, which is 
consistent with the assumption that the inter-aural time differences are 
processed in the inferior colliculus. 

Another experimental observation was the larger N1 response to the 
onset of the ‘pseudo-stereo’ stimulus as compared to the onset of the ‘mono’ 
stimulus. Furthermore, N1 responses were larger for the transition from 
‘mono’ to ‘pseudo-stereo’ than for the reversed transition. Stimuli contained 
no power or spectral differences, however, the more ‘spacious’ sound may 
bear more complexity and thus it might cause larger N1 responses and ASSR 
resetting. 

The inter-aural time difference, equivalent to a phase difference, is the 
most important cue for sound localization at 500-Hz carrier frequency used 
in this study. However the phase difference cannot be discriminated in pure 
tones above 1,500 Hz [Zwislocki, 1956]. At higher frequencies the time 
(phase) differences in the sound envelope become the relevant cue for sound 
localization [Bernstein, 2001]. For this case it seems straightforward to 
create corresponding stimuli with a phase shift in the stimulus envelope 
instead of the carrier signal and record equivalent responses. 

Binaural hearing is frequently impaired in patients with central auditory 
processing disorders (CAPD). This causes, for example, severe problems in 
discriminating speech in a noisy environment. The detection of the binaural 
interaction component (BIC) is of diagnostic interest for children at risk for 
CAPD [Delb, 2003]. The BIC is a component of the auditory evoked 
brainstem responses, which is derived from sequential ABR recordings as 
differences between the binaural and the sum of the two monaural responses. 

Unfortunately, the BIC is a very small evoked potential and not always 
detectable with sufficient reliability. The auditory evoked response observed 
in the present study is in contrast a robust indicator for binaural processing 
and might serve as a complement or replacement for the BIC in clinical 
screening procedures. 
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right (R) and left (L) hemispheres are also shown. 
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ABSTRACT 

Auditory steady-state responses (ASSR) were evoked with 40-Hz amplitude modulated 500-Hz tones. An additional impulse like noise stimulus 
(2000 ±500 Hz) with spectrum clearly distinct from the one of the AM sound induced pronounced perturbations in the ASSR. The effect of the 
interfering noise was interpreted as (1) reset of the ASSR because of a sudden loss in phase coherence, (2) a decrease in signal power immediately 
after presentation of the noise impulse, and (3) a modulation of ASSR amplitude and phase resembling the time course of the ASSR onset. The time- 
course of the ASSR onset was interpreted as reflecting temporal integration over several 100 ms. The reset of the ASSR was discussed as a powerful 
mechanism, which allows for fast reaction to a short stimulus change that overcomes the disadvantage of the ASSR’s long integration time constant. 

KEY WORDS 

Auditory steady-state responses, primary auditory cortex, temporal integration, phase coherence, phase reset. 

INTRODUCTION 

Oscillatory neural activity is associated with distinct behavioural states. Common oscillations in neuronal networks are discussed as related to 
higher order sensory representation such as feature binding [Gray, 1999]. Apparently, the cortical responses evoked by transient stimuli seem to be 
another type of neural activity. However, experimental results in the auditory [Sayers, 1974] and visual modality [Makeig, 2002] gave evidence that 
the reset of spontaneous oscillatory alpha activity results in the N1 response as seen in the averaged evoked response. The auditory steady-state 
response (ASSR) is a special type of externally driven oscillatory brain activity, which can be recorded by means of EEG or MEG and shows 
maximal amplitudes when the auditory stimulus is repeated at a rate of about 40 Hz. The 200 ms lasting ASSR onset was interpreted as representing 
temporal integration in the auditory system [Ross, 2002]. However, a question arose of how the system could react to a concurrent stimulus in spite 
of the steady-state representation of an ongoing periodic sound eliciting the ASSR. It has been already demonstrated that short violation of the 
periodicity of the steady-state stimulus eliciting the ASSR resulted in a sudden reset of the ASSR followed by a new integration period [Ross, 2004]. 
Therefore, the goal of this study was to show whether in general a concurrent stimulus would reset the ASSR evoked by the AM sound. Perturbations 
of the amplitude and latency induced by an interfering stimulus have been previously reported from EEG studies [Makeig, 1989][Rohrbaugh, 1990]. 

METHODS 

Twelve healthy, normal hearing, right-handed subjects (five female) in the age of 21 to 48 years participated in the study which was reviewed by 
the Ethics Commission of the Baycrest Centre for Geriatric Care. The stimuli were 40-Hz amplitude modulated (AM) tone-bursts (500 Hz) of 800 ms 
duration. A brief impulse of band-pass noise (2.000 ±500 Hz) with half-intensity duration of 5 ms was added at 400 ms after the stimulus onset 
(Fig. la). The stimuli were presented binaurally with an intensity of 60 dB above the individual sensation level. The threshold for the noise impulse 
was about 8 dB higher than that for the AM tone. 1024 stimuli were presented with an ISI of 200 ms. The MEG was recorded with a 151-channel 
whole head magnetometer (OMEGA, CTF Systems Inc, Port Coquitlam, Canada) in a magnetically shielded silent room with the subjects in seated 
position. The subjects were passively listening to the stimuli while watching a soundless movie. Sources of the ASSR were localized by 
approximating the magnetic fields of single equivalent current dipoles in each hemisphere to the 32-48 Hz band-pass filtered averaged data. Time- 
series of magnetic dipole moment were calculated based on the estimated source coordinates. The transient P1-N-P2 waves were extracted by low- 
pass (0-24 Hz) and the ASSR by band-pass-filtering (24-60 Hz). The phase coherence (Rayleigh statistic) was obtained from the coefficients of a 
short time Fourier transform (256 pts. FFT, Kaiser window, 50 ms half-intensity length). 

RESULTS 

An overview of the grand averaged response waveforms (cortical strength, nAm) is given in Fig. lb-i. The response to an AM burst without 
interfering noise impulse exhibits the P1-N1-P2 waves after stimulus onset and the sustained field, which corresponds to the stimulus duration. An 
additional transient PI-wave was evoked by the noise impulse (Fig. lc). Three distinct intervals characterize the time-course of the ASSR (Fig. Id). 
During the first 200 ms the ASSR develops with linearly rising amplitude. The constant amplitude between 200 ms after stimulus onset and the end 
of the stimulus characterizes the steady-state interval. After the stimulus termination the ASSR decayed within less than 50 ms. The ASSR amplitude 
shows a sudden decrease after occurrence of the interfering noise impulse and then it rebuilds within 200 ms (Fig. lg). Characteristic for the ASSR 
onset is the continuous change in the phase difference between the ASSR and the AM stimulus that is equivalent to a latency increase of 4 ms 
(Fig. lh). A phase deviation was observed for the ASSR after the occurrence of the noise impulse at 400ms (Fig. li). The time-frequency analysis of 
the dipole source waveform (Fig. In) reveals a high degree of phase coherence for the 40-Hz ASSR and the transient responses in gamma-band and 
the low frequencies after stimulus onset followed by a decrease in phase coherence in the gamma-band at 440-520ms after the noise impulse. The 
corresponding phase histograms in Fig. lk-m demonstrate uniformly distributed phase angles before stimulus onset, highly ordered phase during the 
ASSR, and uniformly distributed phase angles after the interfering noise. Accompanying graphs of the averaged spectral power exhibit the enhanced 
40-Hz activity during the ASSR interval. However after the noise impulse the power spectrum resembled the one of the pre-stimulus interval. 

DISCUSSION 

A noise impulse with an acoustical spectrum clearly different from that of the AM stimulus induced a pronounced perturbation of 40-Hz ASSR. 
Only a minor interaction between both stimuli in the auditory periphery was assumed due to the distinct differences of their acoustical spectra. 
Because the origin of the ASSR equivalent source has been localized in the primary auditory cortex, it is possible that the origin of the observed 
interaction might be located also the primary or even in lower auditory structures. 
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Figure 1 a: auditory stimulus, b: 0-24 Hz filtered response an AM burst stimulus, c: 0-24 Hz filtered response an AM burst stimulus with interfering 
noise impulse, d: time-course of the ASSR, e: time-course of ASSR with interfering noise impulse, f-g: ASSR peak amplitudes, h-I: latency 
difference between ASSR peaks and stimulus signal (adjusted to zero between 400 and 800 ms), k-m: phase histograms of 40-Hz responses at -80, 
240, and 480 ms, n: contour plot of phase-coherence (step size 0.1) in time-frequency domain, o: spectral power of the dipole at -80, 240, and 480 
ms. 


Several features of the obtained experimental data suggest that the induced perturbation is a reset of the ASSR. (1) The time-course of decline of 
amplitude and phase resemble that of the onset. (2) Even the deterministic stimulus resulted in a loss of phase synchronization. The 200-300 ms 
lasting onset of the ASSR was interpreted as reflecting temporal integration in the auditory system. This means that the fully developed ASSR 
corresponds to a well-established representation of the AM sound in the primary auditory cortex. However, a system with long integration time 
constants is less sensitive to changes in the input signal. A reset mechanism as demonstrated with the presented data overcomes this problem. The 
observation of decreased ASSR latency after the interfering stimulus by Rohrbaugh et al. [Rohrbaugh, 1990] was interpreted as increased sensitivity 
during orienting to the stimulus. In contrast, the sudden reset of the ASSR during passive listening suggests a pre-attentive mechanism, which allows 
the detection of a concurrent stimulus in the presence of dominant sound. The obtained experimental results give further insights into the generation 
mechanism of ASSR, which indicate more likely an additional oscillatory activity than synchronization of an ongoing spontaneous 40-Hz activity. 
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ABSTRACT 

We examined the effects of spectral complexity of tones on the Nl/Nlm and P2/P2m late auditory evoked potentials and fields in musicians (n = 
8) and nonmusicians (n = 8). Three of the stimuli were variants of the C4 piano tone. All had the same temporal envelope but varied in spectral 
content. The first tone had the C4 fundamental only, the second had the C4 fundamental and the first two harmonics, and the third had the C4 
fundamental and the first 9 harmonics. The fourth tone was a pure tone matched to the fundamental frequency of the C4 piano tone. Tones were 
presented at 60 dB above threshold measured for each subject and stimulus prior to testing. Musicians exhibited larger P2/P2m responses than 
nonmusicians for all tones. P2/P2m amplitude also increased with the spectral complexity of the tones in both groups. Nl/Nlm amplitude tended to 
be larger in musicians than nonmusicians but was not affected by spectral complexity. The results corroborate previous findings that P2 is enhanced 
in musicians [Shahin, 2003] and suggest that P2 is implicated in spectral processing. 

KEY WORDS 

Auditory Evoked Fields, Auditory Evoked Potentials, EEG, MEG, Spectral complexity 

INTRODUCTION 

The Nlm (magnetic counterpart of the Nl) and P2 components of the late auditory evoked response are neuroplastic. They have been shown to 
increase in amplitude following frequency discrimination training [Menning, 2000] [Bosnyak, 2004] [Tremblay, 2001] and to be augmented in 
musicians compared to nonmusicians [Pantev, 1998] [Shahin, 2003]. The purpose of this study was to investigate the mechanisms underlying 
modification of the Nl/Nlm and P2/P2m by experience. Animal data indicate that remodelling of auditory cortical representations is gated by 
forebrain neuromodulators which make neurons more sensitive to their afferent inputs [Freund, 1992] [Buonomano, 1998]. Under these conditions 
the cortex appears to encode its experience by Hebbian mechanisms operating within a competitive network. If N1 and P2 enhancements in 
musicians depend on this process, we would expect A2 neurons underlying the Nl/Nlm and P2/P2m to encode the specific spectral and temporal 
features of acoustic stimuli, and that enhancement of the Nlm and P2 should depend on the processed features being present in the input. Here we 
tested this hypothesis. Musicians trained on the piano listened to three piano tones which were manipulated to differ in their spectral complexity 
while the temporal envelope of each tone was held constant. Comparison among the tones indicated whether Nl/Nlm and P2/P2m enhancements 
reported previously in musicians depend on the extent to which the harmonic content of musical sounds is preserved. 

METHODS 

Subjects: We investigated 8 amateur musicians and 8 nonmusicians aged 27.9 ± 7.25 and 28.9 ± 4.34 years respectively. All musician subjects 
reported that they played the piano; 6 musicians identified the piano as their principal instrument. Musicians were recruited from the University of 
Toronto or affiliated institutes and reported practicing their principal instrument an average of 9.25 ±5.1 hours a week. Subjects gave written consent 
approved by the Research Ethics Committee of Baycrest Centre for Geriatric Care. 

Stimuli: There were four 500-ms tones: (1) a C4 piano tone consisting of the fundamental (262 Hz) and nine harmonics (called Piano); (2) a 
piano tone consisting of the C4 fundamental and the first two harmonics (called Piano2); (3) a piano tone consisting only of the C4 fundamental 
(called PianoO); and (4) a pure tone at the C4 fundamental (called pure). Piano, Piano2, and PianoO tones were matched for temporal envelope 
including an onset slope resembling that of the pure tone. 

Procedure: MEG and EEG were recorded using a combined 151 whole-head magnetometer and a 32 channel EEG acquisition system (CTF 
Systems, Port Coquitlam, Canada). For the EEG a 32-electrode cap configured according to the 10-20 system was used with Cz as a reference and an 
electrode placed at the collar bone as a ground. Impedances were reduced below 10 k-Ohms for all electrodes prior to the experiment. EEG and 
MEG were sampled at 312.5 Hz from dc to 100 Hz. The experimental session took place in an electrically and magnetically shielded room. Tones 
were presented through a plastic tube at a sound intensity 60 dB above individual threshold values for each ear, with an interstimulus interval (offset 
to onset) uniformly randomized between 3 to 4 seconds. Subjects watched a silent movie during the session and were monitored for movements. 

Data analysis: EEG and MEG were analyzed using BESA 2000 (MEGIS Software, Grafelfmg, Germany). The continuous data were digitally 
filtered between 0.1 and 20 Hz and separated according to stimulus type into epochs of 500 ms duration including a 100 ms pre-stimulus baseline. 
Trials containing shifts of ± 200 pV in EEG and/or ± 2000 fT in MEG or greater in any channel were rejected. Accepted trials (mean 86%) were 
averaged according to stimulus type (pure, PianoO, Piano2 and Piano). For source analysis two symmetrical regional sources [Scherg, 1985] were 
fitted, one in each hemisphere, around the latency of Nl/Nlm and P2/P2m peaks determined from the global field power. Source analysis was 
carried out for each subject and tone type separately, but no significant effects of tones or group were found for source locations. Accordingly the 
spatial coordinates of the sources were averaged first over tones within each subject and then collapsed across all subjects to give a source model for 
the musician and nonmusician groups combined. This source model was then converted to a single dipole (one for each hemisphere) and applied as a 
spatial filter to the individual subject waveforms separately for each tone. Spatial location was fixed but orientation was allowed to vary to maximize 
goodness of fit for each subject, tone and component. Nl/Nlm peak amplitudes were determined by a Matlab routine as the maximum inflection 
occurring in the interval 85 and 110 ms after stimulus onset, and the P2/P2m peak amplitudes in the interval 160 and 200 ms. 
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RESULTS 

Fig. 1A depicts the source waveforms 
obtained for AEPs (left panel) and AEFs (right 
panel) for musicians and nonmusicians averaged 
over hemispheres and the four tones. Larger 
Nl/Nlm and P2/P2m peaks were observed in the 
musician compared to the nonmusician group, 
but significance was only reached for the P2 and 
P2m components (main effects of group P2: 

F(l,14)=20.02, p=0.0005; P2m: F(l,14)=4.61, 
p=0.05). Fig. IB shows Nl/Nlm and P2/P2m 
amplitudes for each tone type, separately for the 
musicians and nonmusicians. Tonal stimuli and 
their interaction with groups did not reach 
significance for Nl/Nlm amplitudes. However, a 
main effect of stimuli was found for P2 (F(3,42) 

= 21.09, p=0.00001) as well as an interaction of 
stimulus with group (F(3,42)=3.35, p=0.028). P2 
responses evoked by the Piano and Piano2 tones 
were larger than P2 evoked by the pure tone in 
both groups; among musicians P2 evoked by the 
Piano tone was larger than all remaining stimuli, 
reflecting a specific enhancement of P2 for the 
Piano tone in this group (p<0.02 or better, Least 
Significant Difference (LSD) test). A main effect 
of tone was also found for P2m (F(3,42)=3.1, 
p=0.037) reflecting larger responses to the Piano 
tone than for the PianoO and pure tones (p<0.02 or better, LSD test). The interaction of group with stimulus did not reach significance for P2m. 

DISCUSSION 

These findings show that the P2/P2m is larger for the spectrally-rich tones, and further, that P2 is especially enhanced in musicians compared to 
nonmusicians for the most spectrally rich tones. This finding is consistent with the hypothesis that cortical remodelling is driven by acoustic 
properties of the stimuli experienced during musical practice. It is noteworthy that P2 enhancement by spectral complexity and by musical 
background was more pronounced in the electrical than the magnetic recordings. This may indicate that radial sources may be contributing to the P2 
component which are less visible in MEG. Effects of group and stimulus did not reach significance for Nl/Nlm, although N1 responses in particular 
tended to be larger in musicians compared to nonmusicians for the Piano tone (Figure IB). It is possible that the skill level and practice histories of 
the amateur musicians of the present study were insufficient for effects to materialize for Nl/Nlm. This suggestion is consistent with earlier results 
[Bosynak, 2004] [Tremblay, 2001] which showed that laboratory discrimination training more readily augmented the P2 than the Nl. In general, the 
enhanced effects that we found for spectrally rich piano tones suggest that the specific temporal and spectral characteristics of the input affect the 
details of auditory cortical neuronal network formation. 

ACKNOWLEDGEMENTS 

The study was supported by the Canadian Institutes of Health Research (CIHR) and the International Foundation for Music Research (IFMR). 
REFERENCES 

Buonomano, D.V., Merzenich, M.M. Cortical plasticity: From synapses to maps. Ann Rev Neuroscience 1998; 21,149-186 

Bosnyak, D.J., Eaton, R.A., Roberts, L.E. Distributed auditory cortical representations are modified by training at pitch discrimination with 40-Hz 
amplitude modulated tones. Cereb. Cortex 2004 (in press). 

Freund, T.F., Meskenaite,V., y-Aminobutyric acid-containing basal forebrain neurons innervate inhibitory intemeurons in the neocortex. Proc Natl 
Acad Sci USA 1992; 89:738-742. 

Menning, H., Roberts, L.E., Pantev, C. Plastic changes in the auditory cortex induced by intensive frequency discrimination training. NeuroReport 
2000; 11,817-822. 

Pantev, C., Oostenveld, R., Engelien, A., Ross, B., Roberts, L.E., Hoke, M. Increased auditory cortical representation in musicians. Nature 1998; 392, 
811-814. 

Scherg, M., Von Cramon, D. A new interpretation of the generators of BAEP waves I V: results of spatio-temporal dipole model. Elect Clin 
Neurophysiol 1985; 62,290-299. 

Shahin A, Bosnyak DJ, Trainor L J, Roberts LE. Enhancement of Neuroplastic P2 and Nlc Auditory Evoked Potentials in Skilled Musicians. J 
Neuroscience 2003; 23:5545-52. 

Tremblay K, Kraus N, McGee T, Ponton C, Otis B (2001) Central auditory plasticity: Changes in the N1-P2 complex after speech-sound training. 
Ear & Hearing 22:79-90. 



-100 

-80 

£ -60 

V«1 

c - 2 «- 

o ■ 


Nl 


Nlm 



Pure PianoO Piano2 Piano 



P2 


Pure PianoO Piano2 Piano 


P2m 


£ 


60 

40 


20 



Pure PianoO Piano2 Piano 


Pure PianoO Piano2 Piano 


li 


100 ms 


Musicians 

Nonmusicians 


Figure 1: A. Dipole moment waveforms calculated from MEG and EEG using the source 
locations of Nl/Nlm and P2/P2m averaged across the group and stimuli. Vertical dashed 
lines indicate stimulus onset B. Group mean (n=8) of Nl/Nlm and P2/P2m amplitudes 
evoked by four stimuli for musicians and non-musicians. The error-bars denote the standard 
error of the mean. 
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ABSTRACT 

We used magnetoencephalography (MEG) to examine the brain activation induced by frequency modulated (FM) tones in a directional 
categorization task in comparison to passive listening to the FM tones (exposure condition). Two groups, “experienced” and “inexperienced” subjects 
participated in the study. We quantified the difference between the Task and the Exposure conditions during characteristic time windows, related to 
stimulus and behavioral epochs. The analysis suggests task-related shifts of activation within and between hemispheres. 

KEY WORDS 

MEG: magnetoencephalography, FM: frequency modulation, fMRI: functional magnetic resonance imaging, AC: auditory cortex. 

INTRODUCTION 

Animal studies [Ohl, 1999], [Wetzel, 1998] and a human fMRI investigation [Brechmann, in press] suggested the dominance of the right AC, for 
the directional discrimination and categorization (upward vs. downward, independent of frequency) of FM tones. The scope of this article is to 
investigate the processing of FM tones during a directional categorization task using MEG, which - unlike fMRI - is capable of recording brain 
dynamics with a temporal resolution in the millisecond range, while keeping a satisfactory spatial resolution. 

METHODS 

The acoustic stimuli consisted of 32 FM tones covering one octave - either 
increasing or decreasing in frequency - in the range 0.5-4 kHz i.e. 0.5-1 kHz, 

0.6-1.2 kHz, ... 2-4 kHz. The duration of each FM sweep was 500 ms including 
10 ms ramps in the beginning and at the end. The stimuli were generated outside 
the shielded MEG chamber and led to the subject’s ears via two ~ 6 m long 
plastic tubes (inner diameter 16 mm); the tubes gradually reduced to an ear piece 
individually adapted to the shape of the subject’s outer ear. During each 
measurement session, 224 FM tones were binaurally and randomly presented at a 
sound pressure level of ~ 96 dB at the ear (measured at a reference frequency of 1 
kHz) and at a repetition rate of 0.5 Hz. Accordingly, an entire session lasted ~ 7.5 
min and each FM tone was presented 7 times. 

The subjects who participated in this study were divided into two groups: (a) 

21 “experienced” subjects who participated in previous experiments with FM 
tones (all right-handed, mean age a ~ 26.5 yr, 14 females), (b) 10 “inexperienced” 
subjects who declared that they had never been subjected to experiments with 
such FM tones previously (9 right-handed, 1 left-handed, a ~ 29 yr, 7 females). 

The subjects participated in the following sessions which spread over several 
measurement days: 1) During the Exposure condition, a subject was only listening 
to the sequence of FM tones. 2) Cortical response to directional categorization 
was studied by employing various Task conditions. First, the subject had to 
identify the two directions of frequency modulations (upward and downward) by 
pressing a corresponding response button with the left (up) or the right (down) 
index finger. This task was complemented by two further tasks, in which the 
subject was instructed to respond either to the upward or to the downward FM 
direction only. These latter tasks will provide additional information for the 
contribution of lateralized motor areas involved in the finger movements. During 
all sessions subjects kept their eyes closed. 

Data acquisition was performed continuously using a 148-channel 
magnetometer whole-head system (Magnes 2500, Biomagnetic Technologies, San 
Diego, USA) at a sampling rate of 254 Hz and utilizing a passband of DC to 50 
Hz. Data analysis was carried out by means of Neuromag Analysis Software 
(Elekta / Neuromag Oy, Helsinki, Finland) and BESA (MEGIS Software GmbH, 

Garching, Germany). 



Measurement Measurement 

Figure 1. Response times of all repeatedly measured 
- experienced and inexperienced - subjects. The 
categorization task is performed in both FM directions. The 
stimulus duration is 500 ms. 


RESULTS and DISCUSSION 

Analysis of the experiments provided the following preliminary results. With few exceptions, the response time (mean value of button pressing) 
of all subjects exceeded the stimulus duration by about 100-200 ms. The average period over which the subjects kept the button pressed was 
relatively short (~ 250 ms) and almost constant across all subjects. These findings may be important in a future analysis as to the separation of the 
activation of the motor cortex from the evoked response signal of the auditory cortex. Figure 1 summarizes the response times of all repeatedly 
measured subjects with the directional decision task performed in both FM directions. These behavioral data showed the following characteristics: 1) 
Most repeatedly measured subjects (experienced and inexperienced) showed a significant decrease in their response latency. 2) As a general trend, 
experienced and inexperienced subjects differed in their perfonnance of the discrimination task (the number of hits, misses and false alarms). 
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Whereas most experienced subjects kept a high level of discrimination performance, 
their performance during the course of measurements. In a future analysis, we intend 
to examine whether there is a correlation between the subjects’ performance and the 
corresponding MEG signal, similar to the findings of the complementary fMRI 
study [Brechmann, in press]. 

Epochs of 2000 ms, including an interval of 200 ms as baseline, were extracted 
from the raw data for signal averaging. Preliminary analysis [Simserides, 2003] of a 
selected group of 9 channels sensitive to signals from the auditory cortex had shown 
that (a) there is a clear difference in the averaged magnetic field as a function of 
time, B(t ), between the Task and the Exposure condition, and (b) in both 
hemispheres, the auditory peak occurring around 100 ms after stimulus onset 
(NIOOm) is significantly larger in the Task condition compared to the Exposure 
condition of the same hemisphere. No significant difference in the magnitude of the 
NIOOm during the Task condition has been observed between the two hemispheres. 
Unlike in the fMRI study, where the localization of the activity in the area of 
auditory cortex is topographically precise, data acquisition in MEG is influenced by 
the fact that the underlying cortical activation may cause signal changes in a large 
number of channels spread over the sensor helmet. Nevertheless, some dynamics of 
topographical changes can already be observed in the averaged MEG data. 

In order to investigate the temporal evolution of the cortical activation, and 
quantify the differences between Task and Exposure conditions throughout one 
whole epoch of averaged signals, we used the absolute value of B{t) times the 
duration of the instant, dt, i.e. | B{t) \ dt as a measure of the magnitude of the 
instantaneous activation. Then, the activation over a time window is the integral of 
| B(t ) | dt from the beginning until the end of the window. To evaluate the difference 
between Task and Exposure and gain some insight of the relevant processes, we 
calculated - as an initial step - the difference of integrals ( Dol ), 

tb tb 


the majority of the inexperienced subjects strongly improved 


M, 


\ta,tb\ 


| dt\B(t)\ - | dt\B(t)\ 


ta , Task ta , Exposure 

across four time windows [ ta , tb] which are considered characteristic for the 
measurements: (a) the shortest interval [0, 300 ms] embraces the late auditory 
evoked potentials; (b) the second interval [0, 500 ms] extends from the start until 
the end of the stimulus; (c) the boundaries of the third integral are determined by 
each subject’s mean value of button pressing (t bp ) and button release (t br ) as well as 

the corresponding standard deviations |7b P - c>b p , fa + obr]; (d) the interval after the 

button release till the end of the epoch [fa + c>b r , end]. 

Figure 2 illustrates the Dol values for the abovementioned time windows of one 
subject. The 148 MEG channels are depicted in a two-dimensional projection of the 
sensor helmet onto a plane, with anterior (posterior) on the top (bottom) and left 
cortex to the left side. Signal distributions change as time passes. Initially (Figs. 
2a,b), in this subject, strong differences are observed in the region encompassing 
and around the left and right auditory cortex, with a somewhat different topography. 
However, at later times, there is an expansion of the activity towards the frontal 
regions as well as a relative dominance of the right auditory cortex. 

Further analysis will probably reveal processes behind these changes of regions 
of high activation. Source modeling in combination with the fMRI data of the same 
subjects will be used as a strategy. 
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Figure 2. Contour plots of the activation difference between 
Task and Exposure conditions of one subject, (a) [0, 300 ms]; 

(b) [0, 500 ms]; (c) [t b p -ob p , *br+obr]; ( d ) [>br+cbr, end]. 
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Neuromagnetic Auditory Responses of the Human Brain Evoked by Bandpass Noise 

Soeta, Y., Nakagawa, S., and Tonoike, M. 

National Institute of Advanced Industrial Science and Technology (AIST), Japan, 


ABSTRACT 

Auditory evoked magnetic fields in relation to the bandwidth of bandpass noise were examined using MEG. Pure tone and bandpass noises with a 
center frequency of 500, 1000 and 2000 Hz were used as auditory signals. The bandwidth of the auditory signal was varied to control the maximum 
peak amplitude of the autocorrelation function (ACF). Psychophysical research has revealed that the strength of pitch corresponds well to the 
maximum peak amplitude of the ACF of a sound signal. The stimuli were binaurally delivered through plastic tubes and earpieces inserted into the 
ear canals. All sound signals had the same sound pressure level set at 74 dBA. The stimulus duration was 0.5 s, including rise and fall ramps of 10 
ms. Eight volunteers with normal hearing (22-28 years) took part in this study. The auditory evoked fields were recorded using a neuromagnetometer 
in a magnetically shielded room. Combinations of a reference stimulus (pure tone) and test stimuli (bandpass noise) were presented alternately at a 
constant 2.0 s interstimulus interval and the MEGs were recorded. The results showed that the Nlm latencies were not affected by bandwidth of 
bandpass noise, however, the peak amplitude of Nlm decreased with increasing the bandwidth of bandpass noise. 

KEY WORDS 

Auditory evoked field, Nlm, Bandpass noise, Bandwidth, Autocorrelation function 

INTRODUCTION 

Some researchers have measured the threshold of a sinusoidal signal as a function of the bandwidth of a bandpass noise masker [e.g., Fletcher, 
1940; Greenwood, 1961]. The threshold of the signal increases at first as the noise bandwidth increases, but then flattens off; further increases in the 
noise bandwidth do not change the signal threshold significantly. It is called the “critical bandwidth” at which the signal threshold ceases to increase. 
To account for these results, Fletcher [1940] suggested that the peripheral auditory system behaves as if it contains a bank of bandpass filters with 
overlapping passbands, which are called the “auditory filters”. When trying to detect a signal in a noisy background, the listener is assumed to make 
use of a filter with a center frequency close to that of the signal. Increases in the noise bandwidth result in more noise passing through the auditory 
filter, providing the noise bandwidth is less than the filter bandwidth. However, once the noise bandwidth exceeds the filter bandwidth, further 
increases in the noise bandwidth do not increase the noise passing through the filter. In addition, if the bandwidth of a sound is varied keeping the 
overall intensity fixed, the loudness remains constant as long as the bandwidth is less than the critical bandwidth. If the bandwidth is increased 
beyond the critical bandwidth, the loudness increases [Zwicker et al., 1957; Greenwood, 1961; Sato et al., 2002]. 

The present study aimed at evaluating the magnetic activity of the auditory cortex against bandpass filtered noise. The bandwidth of the bandpass 
filtered noise was varied. The psychological responses to bandpass filtered noise have largely been obtained in humans, while there is little evidence 
of the cortical responses to bandpass filtered noise. The present study used MEG to examine whether or not there is an activity in the auditory cortex 
corresponding to the bandwidth variations of bandpass noise. 

METHODS 


Pure tone and bandpass noises with center frequencies of 500, 1000 or 2000 Hz were used as auditory signals. The bandwidth of the auditory 
signal was varied with a digital bandpass filter. The stimulus duration was 0.5 s, including rise and fall ramps of 10 ms. Auditory stimuli were 
binaurally delivered through plastic tubes and earpieces inserted into the ear canals. All auditory signals had the sound pressure level set at 74 dB for 
the measurement of the autocorrelation function (ACF) at zero delay, 0(0). ACF provides the same information as the power spectral density of a 
signal. The auditory signals were characterized by ACF factors, defined by the delay time of the first maximum peak, xi, and its amplitude, 4>i [Ando, 
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Figure 1 . Typical waveforms of auditory evoked magnetic fields 
from 122 channels in a subject. 


1998]. The measured 4>i increase as the filter bandwidth decreases. 

Eight volunteers (22-28 years; all right-handed) with normal hearing took 
part in this study. Informed consent was obtained from each subject after the 
nature of the study was explained, and all experimental procedures were 
conducted in accordance with the Declaration of Helsinki. The auditory 
evoked fields (AEFs) were recorded using a 122 channel magnetometer in a 
magnetically-shielded room. Combinations of a reference stimulus (pure 
tone) and test stimuli (bandpass noise) were presented alternately at a 
constant 2.0 s interstimulus interval. During recording, subjects watched a 
self-selected silent movie. The magnetic data were sampled at 0.4 kHz after 
being bandpass filtered between 0.03 and 100 Hz, and then averaged 
approximately 50 times. The magnetometer has two pick-up coils in each 
position that measure two tangential derivatives, dBz/dx and dBz/dy, of the 
field component Bz. To evaluate the amplitude and the latency of the Nlm 
peak, the root-mean-square (RMS) of dBz/dx and dBz/dy were determined as 
the amplitude of the responses at each recording position. The Nlm peak 
amplitude and latency was defined as the RMS peak and the latency in the 
latency range from 70-130 ms over the right and left hemisphere. To estimate 
the location and the strength of the underlying neural activity of the Nlm 
wave, a single equivalent current dipole (ECD) was assumed as the source of 
the magnetic field of the Nlm wave. The ECDs that best described the 
measured magnetic field at the Nlm peak latencies were found by least- 
squares fitting in a spherical volume conductor [Kaukoranta et al., 1986]. A 


545 

















P4-1 


one-dipole model was used separately for the left and right hemispheres with a subset of 
channels over each hemisphere. 

RESULTS 

Clear Nlm responses were observed in both the right and left temporal regions in all 
subjects (Fig. 1). The Nlm latencies were not systematically affected by the center frequency 
nor the bandwidth. The peak amplitude of Nlm decreased with increasing the bandwidth of 
bandpass noise. A two way ANOVA (bandwidth x hemisphere) revealed a significant main 
effect of bandwidth on the Nlm peak amplitude for all center frequencies. 

For the dipole strength, results similar to the Nlm peak amplitude results were obtained. A 
narrower bandwidth of bandpass noise provided for a larger Nlm ECD moment. Figure 2 
depicts the mean ECD moment as a function of bandwidth. A two way ANOVA (bandwidth x 
hemisphere) revealed a significant main effect of bandwidth on ECD moments for all center 
frequencies. The ECD locations did not show any systematic variation across subjects as a 
function of bandwidth in any of the center frequencies. 

DISCUSSION 

In the present study, the widest bandwidth in each of the centre frequencies was beyond the 
critical bandwidth. Previous research has indicated that the highest amplitude AEFs increase 
with increasing stimulus intensity level, however the amplitude levels off or even decrease 
slightly beyond 60 dB [Reite et al., 1982a]. The increase in bandwidth beyond the critical 
bandwidth [Zwicker et al., 1957; Greenwood, 1961; Sato et al., 2002], which causes the 
increase in loudness, caused no increase of the Nlm amplitude in the present study. This can be 
attributed to the fact that all stimuli were presented at 74 dB. 

The peak amplitude of Nlm decreased with increasing the bandwidth of bandpass noise. 
Reite et al [1982b] found that AEF amplitudes to pure tones were significantly larger than those 
to white noises for both the right and left hemispheres. Sams and Salmelin [1994] investigated 
the masking of pure tones of the 1000 and 2000 Hz effects on the human auditory cortex by 
continuous white-noise maskers with frequency notches around the tone frequencies. The 
results indicated that the Nlm amplitude becomes larger with larger notches. These results are 
consistent with the present finding. 

There is an ongoing, shifting debate over the respective roles of place and temporal 
representations in the perception of pitch. Rate-place neural models use spatial discharge rate 
patterns of tonotopically-ordered neural maps. In the temporal-place modeling, it is assumed 
that the pitch is extracted with ACF [Cariani and Delgutte, 1996; Patterson et al., 1996]. 
Psychophysical research has revealed that the strength of the pitch corresponds well to the fa of 
the auditory signal [Wightman, 1973; Yost, 1996]. To locate the mechanism for the detection of 
temporal regularity in humans, PET and fMRI experiments have been perfonned using a 
‘delay-and-add’ noise [Griffiths et al., 1998; 2002]. The result showed that the activity of the 
primary auditory cortex, cochlear nuclei, and inferior colliculi increased with the regularity, 
namely 4>i, of the sound. Such a demonstration indicates the presence of a mechanism that 
converts the temporal regularity into a local neuronal activity level to one at the cortical level. 
Therefore, sounds that have more temporal regularity could lead to more cortical activity, 
which causes an increase in the strength of the Nlm response in the present study. 
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Auditory Evoked Fields in Relation to Binaural Cross-correlation 

Soeta, Y. a , Hotehama, T. b , Nakagawa, S. a , Tonoike, M. a , and Ando, Y. b 

a National Institute of Advanced Industrial Science and Technology (AIST), Japan, 
b Graduate School of Science and Technology, Kobe University, Japan 


ABSTRACT 

The binaural cross-correlation of a dichotic stimulus is a measure of the similarity of the waveforms presented to left and right ears. The concept 
of interaural cross-correlation is a fundamental one in the study of binaural hearing, in which the width of the sound image and the binaural masking 
level difference are related to the interaural cross-correlation coefficient (IACC). The present study used MEG to examine whether or not there is an 
activity in the auditory cortex corresponding to the IACC variations of bandpass noise. To create bandpass noises, white noises were digitally filtered 
between 200 and 3000 Hz. IACC of the stimuli was controlled by mixing diotic bandpass and dichotic independent bandpass noise in appropriate 
ratios. The auditory stimuli were binaurally delivered through plastic tubes and earpieces inserted into ear canals of the nine volunteers with normal 
hearing who took part in this study. All source signals had the same sound pressure level. Auditory evoked fields were recorded using a 
neuromagnetometer in a magnetically shielded room. The results showed that the Nlm latencies were not affected by IACC, however, the peak 
amplitude of Nlm significantly decreased with increasing IACC. Therefore, sounds that are interaurally more coherent could lead to more inhibition, 
which causes the decrease of the strength of the Nlm response. 

KEY WORDS 

Auditory evoked field, Nlm, Binaural cross-correlation, Bandpass noise 

INTRODUCTION 

The concept of interaural cross-correlation is a fundamental one in the study of binaural hearing; in which the width and distance of the sound 
image changes according to the interaural cross-correlation coefficient (IACC) [Kurozumi and Ohgushi, 1983; Licklider, 1948]. The IACC is also a 
significant factor in determining the perceived horizontal direction of a sound and the degree of subjective diffuseness of a sound field [Damaske and 
Ando, 1972]. When the IACC is high, a well-defined direction is perceived. Additionally, numerous psychoacoustical studies have shown that the 
IACC is the dominant factor in the determination of the binaural masking level difference (BMLD) [e.g., Licklider, 1948; Osman et al., 1975]. The 
psychological responses to IACC have largely been obtained in humans, while the neurophysiological responses are confined to animal studies. 
Further, since neurophysiological responses to IACC has examined in the auditory brainstem [Ando et al., 1991], there is little evidence of the 
cortical responses to IACC. To obtain such response, the present study used MEG to examine whether or not there is an activity in the auditory cortex 
corresponding to the IACC variations of band-pass noise. 

METHODS 

Bandpass noises were employed for acoustic signals. To create bandpass noises, white noises were digitally filtered between 200 and 3000 Hz 
(Chebychev bandpass: order 18). The magnitude of the interaural cross-correlation function, IACC, is defined by the possible maximum interaural 
time delay, say, |x| < 1.0 ms [Ando, 1998]. The IACC of the stimuli was controlled by mixing in-phase diotic bandpass and dichotic independent 
bandpass noises in appropriate ratios [Blauert, 1983]. The frequency range of these noises was always kept the same. The auditory stimuli were 
delivered to subjects through plastic tubes and inserted earpieces at a comfortable listening level of approximately 68 dBA, adjusted separately for 
each subject. The sound pressure was measured with an ear simulator, including a microphone and a preamplifier, and an adaptor connected to the 
earpiece. The interaural level difference was set to 0 dB and there was no interaural delay between ear signals, i.e., the maximum of the IACC was 
always at x = 0. 

Nine volunteers with normal hearing (22-28 years; all right-handed) 
took part in the study. Informed consent was obtained from each subject 
after explaining to her/him the nature of the study. The experiments were 
conducted in accordance with the principles of the Declaration of Helsinki. 

Brain activities evoked by sound stimuli were recorded using a 122 channel 
whole-head DC superconducting quantum interference device 
magnetometer in a magnetically shielded room. Combinations of a reference 
stimulus (IACC = 1.0) and test stimuli (IACC = 0.85, 0.6, 0.2) were 
presented alternately at a constant 0.5 s interstimulus interval. During the 
experiment, subjects were asked to close their eyes to fully concentrate on 
the stimuli. The magnetic data were sampled at 0.4 kHz after being 
bandpass filtered between 0.03 and 100 Hz, and averaged more than 50 
times. Any responses coinciding with magnetic signals exceeding 3000 
fT/cm were rejected from further analysis. The averaged responses were 
digitally filtered between 1.0 and 30.0 Hz. The analysis time was 0.7 s from 
0.2 s prior to the stimulus onset. The average of the 0.2 s prestimulus period 
served as the baseline. The magnetometer has two pick-up coils in each 
position, which measure two tangential derivatives, dBz/dx and <3Bz/dy, of 
the field component Bz. To evaluate the amplitude and the latency of the 
response, the root-mean-square (RMS) of dBz/dx and dBz/dy were 
determined as the amplitude of the responses at each recording position. The 
Nlm peak amplitude and latency was defined as the RMS peak and the 



Figure 1. Typical waveforms of auditory evoked magnetic fields 
from 122 channels in a subject. 


547 






















P4-1 


latency in the latency range from 70-130 ms over the right and left hemisphere. In each subject, 
we employed the Nlm peak latency and amplitude with a channel that showed the maximum 
amplitude placed at each hemisphere. 

To estimate the location and the strength of the underlying neural activity of the Nlm wave, 
a single equivalent current dipole (ECD) was assumed as the source of the magnetic field of the 
Nlm wave. ECDs were found by a least square search in each hemisphere at the Nlm peak 
latencies. Calculations were based on the spherical conductor model, which takes into account 
the volume current within a sphere [Kaukoranta et al., 1986]. All ECDs with goodness-of-fit 
values exceeding 80% were used in further analyses. 

RESULTS 

Clear Nlm responses were observed in both right and left temporal regions in all subjects 
(Fig. 1). The Nlm latencies were not systematically affected by the IACC. Two way analysis of 
variance (ANOVA) did not show any significant effects of the IACC [F(2, 2) = 0.43, P > 0.05] 
or hemisphere [F(l, 2) = 0.11, P > 0.05] on Nlm latency. Basically, a smaller IACC provided a 
larger Nlm amplitude. Two way ANOVA (IACC x hemisphere) revealed a significant effect of 
the IACC on Nlm amplitude [F(2, 2) = 9.79, P < 0.001]. Variability in the left and right 
hemispheres was not significantly different [F(l, 2) = 0.60, P > 0.05]. For the dipole strength, 
results similar to the Nlm peak amplitude results were obtained. A smaller IACC provided for a 
larger Nlm ECD moment. Fig. 2 depicts the normalized mean Nlm ECD moment as a function 
of the IACC. A two way ANOVA (IACC x hemisphere) showed the significant main effects were 
the IACC [F(2, 2) = 34.42, P < 0.001] and the hemisphere [F(l, 2) = 6.56, P < 0.05]. There was 
no significant interaction [F(2, 2) = 2.99, P > 0.05]. The ECD locations did not show any 
systematic variation across subjects as a function of IACC. 

DISCUSSION 

Using auditory evoked magnetic fields, the brain activities corresponding to the IACC were 
analyzed in this study. A previous study on auditory evoked potential (AEP) investigated the 
IACC using a one-third octave bandpass-filtered noise with the center frequency of 500 Hz as a source signal [Ando et al., 1987]. The results showed 
that the peak-to-peak amplitude N1-P2 decreases with an increasing IACC. In the present study, a higher IACC provided for a significantly smaller 
amplitude of Nlm, P2m, and the peak-to-peak amplitude Nlm-P2m. A previous study on Nlm response to dichotic tones of different frequencies 
found that Nlm increased with the interaural frequency disparity [Yvert et al., 1998]. Significantly smaller AEPs or AEFs amplitudes were obtained 
with binaural stimulation, compared to those obtained with monaural contralateral stimulation; indicating some kind of interference between 
ipsilateral and contralateral pathways (‘binaural interaction’) [Pantev et al., 1986; McPherson and Starr, 1993]. It is considered that the more sounds 
differed arriving at both ears, the larger the Nlm amplitude might be. 

A starting point for modeling efforts in binaural hearing is the model of Jeffress [1948], which produces estimates of cross-correlation functions 
of its two inputs. Several theories regarding binaural systems rely on a cross-correlator to act as a comparator element for signals arriving at both the 
left and right ears [Blauert, 1983; Findemann, 1986; Osman et al., 1975]. It has been shown that the central binaural neurons perform an operation 
very similar to the cross-correlation of the inputs [Ando et al., 1991; Saberi et al., 1998; Yin et al., 1987]. These inputs are transformed from the 
actual acoustic signal by the peripheral auditory system; and these transformations are reflected in the properties of the cross-correlations. Among 
various approaches to incorporating the evaluation of two interaural cues, namely interaural level and interaural arrival time differences, into one 
consolidated model; a binaural cross-correlation model by contralateral inhibition was proposed [Findemann, 1986]. A model using an interaural 
cross-correlation analysis of the ear input signals should be able to filter out the components of the ear input signals that are interaurally coherent 
[Blauert, 1983; Findemann, 1986]. Therefore, sounds that are interaurally more coherent could lead to more binaural inhibition, which causes the 
decrease of the strength of the Nlm response. 
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Figure 2. The mean ECD moments of the 
Nlm from (O): left and (•): right 
hemispheres as a function of IACC. Error bars 
are the 95% confidence interval. The Nlm 
amplitudes and ECD moments are normalized 
within each subject with respect to the 
maximum value. The asterisks indicate 
statistical significance (**P<0.01). 
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ABSTRACT 

The purpose of this study was to investigate interaction of auditory pathway at the level of primary auditory cortex. An auditory evoked magnetic 
field has been investigated from the beginning of MEG studies. We examined auditory responses in 15 healthy subjects, 8 males and 7 females. The 
corresponding auditory evoked magnetic fields with stimuli to the right ear, left ear and both ears were recorded using a 320-channel whole-head 
MEG system developed by Kanazawa Institute of Technology. The stimuli were 1kHz short tone burst of 50ms duration with the interstimulus 
interval of about Is. The responses were averaged by 100 stimulus presentations of each measurement. We could find clear response peaks such as 
NIOOm and P200m in 14 subjects. A variety of response patterns were observed, especially in the NIOOm and it was not simple to determine 
response laterality; some differences were found with respect to previous studies. 

In this measurement, we could classify responses into three major groups using the isofield contour map and peak latency of NIOOm. The first 
group was the general type which is contra lateral dominance and slightly hyperactivity of right hemisphere in both ears’ stimulus. The 2nd group 
showed right hemisphere dominance even in right ear’s stimulus. The 3rd group showed left hemisphere dominance even in left ear’s stimulus. As a 
special case in the 2nd group, one subject showed almost no response in the left hemisphere. In this subject, the stimuli were altered to vowels and 
auditory evoked magnetic fields were measured. The results showed vowels evoked a greater left hemisphere response than pure tones. The 
explanation of individual variation required further investigations of binaural integration and interaural differentiation. 

KEY WORDS 

Magnetoencephalography; tone-burst stimulus; auditory evoked magnetic field; NIOOm; Hemispheric difference. 

INTRODUCTION 

The purpose of this study was to investigate interaction of auditory pathway at the level of primary auditory cortex. Auditory evoked magnetic 
fields have been investigated from the beginning of MEG studies. The Auditory evoked magnetic field contains the responses of P50m, NIOOm, and 
P200m that are occurred in primary auditory cortex. We discussed the NIOOm response in certain condition, which was different with previous 
studies. 

METHODS 

Neurologically normal subjects were 15, which were 
8 males and 7 females aged 23 to 50 participated in this 
study. Informed consent was obtained from each subject 
before the experiment. All subjects had no hearing 
disturbance. 

MEG signals were recorded in a magnetically 
shielded room using a 320 channel whole-head MEG 
system, which consists of 160 channels for 
magnetometers and 160 channels for 50mm base-lined 
co-axial gradiometers, developed by Kanazawa Institute 
of Technology [Higuchi 2003]. Subjects lay down on the 
bed and inserted their heads into the helmet-shaped 
sensing part. They were instructed to keep their eyes 
open, and to fix their eyes on a certain point to minimize 
their eye’s movements. 

The auditory stimuli were lKHz short tone burst of 
50ms duration with the interstimulus interval of about 
1 s, and delivered to the right ear, left ear and both ears 
respectively through plastic tubes attached to earplugs. 

The responses were filtered with 0.1-200Hz pass band 
by analog filters, sampled with 1kHz and averaged by 
100 stimulus presentations of each measurement. The 
analysis period was 100ms before to 500ms after 
stimulus onset.. 

RESULTS 

We could find clear response peaks such as NIOOm and P200m in 14 subjects. Especially in NIOOm, it revealed that there were some patterns and 
it was not simple to determine contra lateral dominancy or hyperactivity of the right auditory cortex than the left. Some results were not compatible 
with previous studies. In this measurement, we could classify them into three major groups using the isofield contour map and peak amplitude of the 
NIOOm. 



Isofield Contour Map 


400 450 

Time [msec! 



30fT/Step 


Figure. 1 An example data (both ears’ stimuli) of the 1st group. Left : Waveforms, Right: 
Isofield contour map. 



Isofield Contour Map 



Sink Source 
20fT/Step 


Figure 2. An example data (right ear’s stimuli) of the 2nd group. Left : Waveforms, 
Right: Isofield contour map. 
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The first group is that it has contra lateral dominancy and slightly hyperactivity of right hemisphere in both ears’ stimuli. In this group, NIOOm 
responses were observed dominantly in the left hemisphere by right ear’s stimuli, and dominantly in the right hemisphere by left ear’s stimuli. When 
stimuli were added to both ears, NIOOm responses were slightly dominant in the right hemisphere as shown in Figure. 1. 6 subjects were included in 
this group. 

The 2nd group is that it has right hemisphere dominancy even by right ear’s stimuli. In this group, NIOOm responses were observed dominantly in 
the right hemisphere by left ear’s stimuli. However in the left hemisphere they were not dominant even by right ear’s stimuli as shown in Figure.2. 
When the stimuli were added to both ears, NIOOm responses were dominant in the right hemisphere. So it is very clear that the right hemisphere is 
more active than the left hemisphere. 6 subjects were included in this group. 

The 3rd group is that the left hemisphere is dominant even by left ear’s stimuli. In this group, NIOOm responses were observed dominantly in the 
left hemisphere than the right hemisphere by right ear’s 
stimuli. However in the right hemisphere they were not 
dominant even by left ear’s stimuli as shown in Figure.3. 

When stimuli were added to both ears, they were 
observed dominantly in left hemisphere. So this means 
that the left hemisphere is more active than the right 
hemisphere. Two subjects were included in this group. 

As a quite special case in the 2nd group, one subject 
showed NIOOm responses hardly appear in the left 
hemisphere even by right ear’s stimuli as shown in 
Figure.4(left). The subject was 24 years female, and has 
no history of ear disease and no symptom of hard of 
hearing. The pure-tone audiometric test and the ABR test 
were normal. The equilibrium test was also normal. Furthermore, there were no any abnormal findings in brain MRI morphologically and PET 
metabolically. The dipole localization in the right hemisphere was supposed to exist in the auditory cortex in the right temporal lobe. But the dipole in 
the left hemisphere was located out of the auditory cortex as shown in Figure.4(right). 

DISCUSSION 

It has been discussing about hemisphere asymmetry by stimuli as like vowel and non-speech[Gootjes 1999] and presence of anatomical 
difference of myelination[Penhune 1996]. In the extreme case of the 2nd group, there are some speculations, which are no or less number of activate 
neurons in left hemisphere with any stimuli in N100m, due to synchronization of activated neurons in recording and orientation of dipole direction. 

Therefore it was not simple to determine contra lateral dominancy or hyperactivity of right auditory cortex than left. The explanation of 
individual variation is required further investigations of binaural integration and interaural differentiation. 
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ABSTRACT 

We studied the cortical processing of spatial stimuli by magnetoenchephalographic (MEG) measurements using broadband noise bursts presented 
from eight sound source directions in the horizontal plane. The stimuli were individually created for each subject by using three-dimensional (3D) 
sound techniques. The subjects carried out a behavioral task where their accuracy for localizing the 3D stimuli was established. We found that the 
auditory N100m response was sensitive to the sound source direction, exhibiting contralaterally more preponderant responses in both the left and the 
right hemisphere. Generally, responses were more prominent in the right hemisphere. The behavioral performance of the subjects correlated 
positively with N 100m amplitude organization, showing that the dynamics of auditory cortex predict behavioral sound detection. 

KEY WORDS 

Human, Auditory, Cortex, Spatial, Sound Localization, Magnetoenchephalography, MEG, NIOOm. 

INTRODUCTION 

The human auditory system exploits binaural localization cues, the interaural time and level differences (ITD & ILD respectively) and spectral 
cues produced by the filtering effects of the pinna, the head and the body [Blauert, 1997]. Previous brain research has addressed auditory spatial 
processing in headphone listening by employing ITD and ILD modifications whereby the sounds are experienced as originating inside the head. 
Recent developments in 3D-sound technology, however, have opened up the possibility of presenting spectrotemporally rich stimulation through 
headphones, resulting in the stimuli being perceived as originating in genuine 3D-space. 

Studies of auditory spatial processing in the human brain have rapidly gained momentum [Fujiki et al, 2002] [Pantev et al, 1998] [Teder-Salejarvi 
& Hillyard, 1998] and it has been established that right-hemispheric auditory areas are specialized in the processing of spatial information [Kaiser et 
al, 2000] [Palomaki et al, 2000, 2002] [Zatorre et al, 2002]. Further insight into spatial processing could be gained by linking brain processes to 
psychophysical measures. For this purpose, we presented individualized 3D stimuli to subjects in MEG measurements and in separate behavioral 
measurements where the subject indicated the perceived direction of the sound source. We then determined how well the NIOOm response predicted 
behavioral performance. 

METHODS 

Ten subjects participated in the experiment with informed consent. For creating individualized 3D stimuli, microphones were attached to the 
entrance of the subject's ear canals and recordings were made using a 50-ms, 10-kHz noise burst presented through loudspeakers at eight locations in 
the horizontal plane (0°, 45°, 90°, 135°, 180°, -135°, -90° and -45°, where 0°/180° = front/back & -90°/90° = left/right, respectively). In MEG 
recordings, the stimuli were delivered to the subject's ears with a tubephone sound system (100 Hz - 10 kHz, ±7.5 dB) using an interstimulus interval 
of 750 ms. Brain activity was recorded (passband 0.03-200 Hz, sampling rate 600 Hz) with a 306-channel whole-head magnetometer (Vectorview, 
Elekta Oy, Finland). The subject sat in a reclining chair under instruction not to pay attention to the auditory stimuli and to concentrate on watching a 
self-selected silent film. Over 100 instances of each direction angle were presented to each subject in a random order. The responses were averaged 
and baseline-corrected with respect to a 100-ms pre-stimulus period and bandpass filtered at 2-30 Hz. The NIOOm amplitude was analyzed from the 
gradiometer sensor pairs displaying maximum responses and its source location was estimated using unrestricted equivalent current dipoles (ECDs), 
separately above the left and right hemisphere. 

The subjects performed a behavioral localization task where responses 
to 480 random-order stimulus presentations were collected utilizing a 
computerized test program where the subject used a mouse to indicate the 
horizontal direction of the stimuli. Localization accuracy was measured in 
terms of the mean angular error, front-back confusions, and the number of 
misses. Front-back confusions, that is, where a stimulus arriving from the 
front (0°, 45°, -45°) was perceived as arriving from behind (135°, 180°, - 
135°) or vice versa , were counted and corrected. We hypothesized that the 
amplitude of the right-hemispheric NIOOm reflects behavioral sound 
localization by being an increasing monotonic function of the horizontal 
angle, varying from an ipsilateral minimum to a contralateral maximum. 

This leads to a hypothetical amplitude order of 90°, 457135°, 07180°, - 
457-135°, and -90°. For testing this hypothesis we developed an Angular 
Organization Test (AOT) which ranks the individual subject’s NIOOm 
amplitudes in terms of a distance measure: In the test, the NIOOm responses 
for the different angles are ordered according to amplitude. If this order 
exactly corresponds to the hypothesis, the distance equals to zero. If the 
order differs from the hypothesis, the ordinal distances between the 
hypothesized and observed positions are summed. This procedure yields a 
ranking-order scale ranging from 0 to 8, reflecting how well the amplitude 
of the NIOOm corresponds to the hypothesis. 


NIOOm for contra- & ipsilateral 3D stimuli 

Left hemisphere: Right hemisphere: 



Stimulus: -90° Stimulus: 90° Stimulus: -90° Stimulus: 90° 

NIOOm amplitude across stimulus direction angle 

Left hemisphere: Right hemisphere: 



Direction Angle (deg) Direction Angle (deg) 

Figure 1. Top: grand-averaged N100m responses for contra- and 
ipsilateral stimulation. Bottom: the NIOOm amplitude behavior 
for 3D stimuli from eight direction angles. The right-hemispheric 
NIOOm is more prominent and displays a larger dynamic range. 
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RESULTS 

Stimulus direction angle had a significant effect on the amplitude of both the left- (F[7,63] = 4.21, 
P < 0.001) and right-hemispheric (F[7,63] = 5.73, P < 0.001) NIOOm responses and, in both 
hemispheres, the NIOOm elicited by contralateral stimulation was larger than that elicited by ipsilateral 
stimulation (Fig. 1). Overall, the response amplitudes were larger in the right (79 fT/cm) than left (47 
fT/cm) hemisphere, with the difference approaching statistical significance (F[l,9] = 4.35, P = 0.067). 
Moreover, the interaction between hemisphere and direction angle was statistically significant (F[l,63] 
= 2.89, P < 0.05) presumably due to the larger dynamic range of the NIOOm amplitude in the right 
hemisphere. Stimulus direction angle also had a significant effect on the latency of the NIOOm in the 
right hemisphere (F[7,63] = 2.19, P < 0.05) with left-hemifield stimulation resulting in shorter response 
latencies (123 ms) than right-hemifield stimulation (129 ms). No significant effects on the NIOOm 
latency were observed in the left hemisphere (P = n.s.). Direction angle did not have significant effects 
in the ECD source locations of the NIOOm (P = n.s.). The AOT-measure, reflecting how well the 
NIOOm amplitude was organized according to stimulus direction angle, yielded an average index of 
2.90. 

Figure 2 shows the perceived direction angle as a function of the actual stimulus direction. The 
subjects were able to localize the stimuli correctly, as indicated by the linear relationship between 
stimulus direction angle and behaviorally reported angle (F[7,63] = 1157.29; P < 0.001, misses = 
1.22%, front-back confusions = 24.5%). On the average, the mean angular error was 12.5° (F[7,63] = 
3.92; P < 0.01). The angular errors for stimuli presented from the front-back (9.0°) and left-right 
directions (8.5°) were smaller than those for the stimuli presented from oblique angles (16.3°). The 
correlation coefficient between the angular error averaged over stimulus direction angle and the AOT 
test was r = 0.80 (P < 0.01), indicating that the behavioral localization ability of the subjects is reflected 
in the organization of NIOOm amplitude according to the hypothesized pattern. 

DISCUSSION 


Stimulus direction angle X behavioral report: 

-45 
-90 
•135 
180 
135 
90 
45 
0 | 

0 45 90 135 180 -135 -90 -45 

Mean angujar error for reported^direction: 

20 




90 135 180 -135 -90 -45 
Direction Angle (deg) 

Figure 2. Top: the perceived 
direction angle as a function of actual 
stimulus direction (bars indicate 
standard error of the mean). Bottom: 
mean angular error of behavioral 
responses (degrees). Note the linear 
relationship between the actual and 
perceived direction angle and 
differences between front/back and 
left/right vs. oblique directions. 


Corroborating the results of previous research [Kaiser et al, 2000] [Palomaki et al, 2000, 2002], we 
found that the right hemisphere of the human brain appears to have a dominant role in spatial 
processing. Firstly, the variation of the NIOOm amplitude across stimulus direction angle was larger in 
the right hemisphere. Presumably, a larger dynamic range of the NIOOm amplitude allows for better 
behavioral discrimination. Secondly, we found that the latency of the right-hemispheric NIOOm exhibits 

directional tuning to sound location. Finally, the statistically non-significant source location shifts in ECDs for the NIOOm responses elicited by 3D 
stimuli do not support the idea that auditory cortex uses a location code to represent 3D information. Thus, it appears that the cortex utilizes 
amplitude and latency coding for the representation of spatial infonnation. 

In assessing how brain dynamics are reflected in overt behavior, the AOT result revealed that 3D stimuli elicited right-hemispheric NIOOm 
responses with amplitudes consistently organized according to stimulus direction angle. The AOT result was further validated by the correlation 
between the AOT results and the behavioral results on mean angular error. This correlation suggests that the amplitude organization of the NIOOm 
predicts behavioral localization performance for stimuli containing spatial cues. 
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ABSTRACT 

Voiced speech is created by the fluctuating vocal folds generating the glottal pulseform. This excitation signal is the source of the speech 
fundamental frequency and its harmonic integer multiples. Periodic glottal excitation is required for the elicitation of speech-specific cortical 
processes indexed by the auditory NIOOm response. Here, we studied the cortical processing underlying the perception of the vowels /a/ and /u/ 
produced using normal and aperiodic phonation. The behavior of the NIOOm, registered with magnetoencephalography (MEG), was studied in 10 
subjects. The amplitude and latency of the NIOOm as well as the center of gravity of the activated cortical areas varied as a function of stimulus 
periodicity. Further, the presence of glottal excitation had differential effects on the latency of the NIOOm elicited by the vowels /a/ and /u/. Thus, 
changes affecting the perceptual quality of speech signals without changing their phonetic content modify the dynamics of human auditory cortex. 

KEY WORDS 

Human, Auditory, Cortex, Speech Perception, Fundamental Frequency, Vowels, Magnetoencephalography, MEG, NIOOm. 

INTRODUCTION 

The acoustic content of speech is determined by the structure and functioning of the human voice production mechanism. The length of the vocal 
folds is the primary physiological variable underlying adjustment of the speech fundamental frequency (F0) and, consequently, its harmonic integer 
multiples (2 • F0, 3 • F0 etc.). The shape and the length of the vocal tract, then again, determines the locations of the formant frequencies (FI, F2, 
etc.). Furthermore, as the length of the vocal folds is proportional to the length of the vocal tract, F0 and the positions of the formant frequencies are 
also proportional: long vocal folds, typical for males, generally imply low formant frequencies. 

By using synthetic stimulation it has been shown that the vowel /u/, which has relatively low FI and F2 values (310 & 870 Hz), activates the 
auditory cortex at a longer latency than the vowel /a/, which has higher FI and F2 values (approx. 710 & 1100Hz) [Poeppel et al, 1997]. This latency 
difference is similar to that obtained with pure tones and tone complexes [Roberts & Poeppel, 1996] [Ragot & Lepaul-Ercole, 1996] and could be 
explained by the human auditory cortex responding to frequencies in the 500 - 4000 Hz range at a relatively invariant latency while stimuli at lower 
frequencies result in delayed responses. However, in our recent study we used the vowels /a/ and /u/ with natural periodic glottal excitation and 
observed no such latency differences [Makela et al, 2003]. In a related study, we found that the vowel /a/ comprising a natural periodic glottal 
excitation structure elicited significantly larger NIOOm responses than the same phoneme with an aperiodic structure [Alku et al, 2001]. Furthermore, 
the violation of the glottal periodicity was accompanied with the cortical activation shifting to a more posterior location in both the right and the left 
hemisphere. Moreover, we recently found that the vowel /a/ and a pseudo-vowel lacking in phonetic content elicited, both with natural male and 
female FOs, shorter-latency NIOOm responses than pure tones of corresponding frequencies [Makela et al, 2002]. These observations have led us to 
propose that the presence of the periodic glottal excitation in the stimulus is a prerequisite for the elicitation of speech-specific cortical activity. 

Here, we report our preliminary results of a study aimed at clarifying the effects of the periodic glottal excitation and the F1-F2 formant 
frequencies in the cortical processing of two Finnish vowels, /a/ and /u/. We presented the subjects with two variants of each vowel, one with a 
periodic structure generated by a natural glottal excitation and the other with the aperiodic structure produced with a noise excitation. 

METHODS 

Ten right-handed, normal-hearing subjects (age 20-44 years, 5 females) participated in the study with informed consent. The subjects were 
instructed not to pay attention to auditory stimuli and to concentrate on reading a self-selected book or to watch a silent video. The stimuli were 
created by using the Semi-synthetic Speech Generation method [Alku et al, 1999], where the vowels /a/ and /u/ were synthesized using a natural, 
periodic glottal excitation produced by the vibrating vocal folds (F0 =115 Hz). The lowest four formant frequencies of the vowels were adjusted as 
follows: /a/: FI = 670 Hz, F2 = 1000 Hz, F3 = 1950 Hz and F4 = 3440 Hz, /u/: FI = 330 Hz, F2 = 580 Hz, F3 = 1900 Hz and F4 = 2900 Hz. The 
aperiodic counterparts of the vowels were produced by replacing the natural glottal excitation with a noise sequence whose spectral envelope was 
matched to that of the glottal excitation. Sound energy (computed as the squared sum of the digital time-domain signals) was equalized across the 
stimuli, and, for each subject, the sound pressure level was adjusted to a comfortable level by using the vowel /a/ as a reference stimulus (intensity 
range 70-75 dB SPL). The 200-ms stimuli were binaurally delivered to the subject through plastic tubes and ear pieces at an interstimulus interval of 
800 ms. The stimuli ( N> 150) were presented in a pseudorandom order, counterbalanced across subjects. 

Cortical activation elicited by the stimuli was registered by using a 306-channel MEG measurement device (Vectorview, Elekta Neuromag Oy, 
Finland) in a magnetically shielded room. The evoked responses were averaged over a period of 600 ms including a 100-ms pre-stimulus baseline 
and 1-30 Hz passband filtered. Epochs exceeding an absolute amplitude variation of 3000 fT/cm were excluded online, and electrodes monitoring 
horizontal and vertical eye movements were used in removing artefacts (>150 pV). The NIOOm was studied for effects in amplitude, latency, and 
source location. The waveform analysis was carried out for the field gradient vector sums from three pairs of planar gradiometers displaying 
maximum NIOOm responses, separately in each hemisphere and for each subject. Equivalent current dipoles (ECD) were fitted to a time point 
defined as the moment of the NIOOm reaching its peak amplitude in the waveform calculated over the 66 sensors located over either the left or right 
temporal brain areas. Statistical analyses were performed by using repeated measures ANOVA and Newman-Keuls post hoc -tests. 

RESULTS 

The analysis of the NIOOm waveforms revealed that glottal periodicity in stimulation had distinctly different effects on the latency of the NIOOm 
elicited by the vowels /a/ and /u/ (F[3,27] = 6.06, P < 0.01). The human auditory cortex responded to the periodic /a/ and its aperiodic counterpart at 
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the same latency (124 & 123 ms; P = n.s.). However, the latency of the NIOOm elicited by the periodic vowel /u/ was considerably shorter than the 
latency of the NIOOm elicited by its aperiodic counterpart (128 & 137 ms; P < 0.05). The NIOOm latencies for the periodic vowels /a/ and /u/ were 
similar (124 & 128 ms, P = n.s.), while the latency of the NIOOm was significantly shorter for the aperiodic /a/ than for the aperiodic /u/ (123 & 137 
ms; P < 0.01). 

The right-hemispheric NIOOm responses were more prominent 
than the left-hemispheric responses (37 & 22 fT/cm; F[l,9] = 11.10, 

P < 0.01). Moreover, the spectral content of the stimulus was 
reflected in the amplitude of the NIOOm: The periodic vowels /a/ and 
/u/ elicited larger NIOOm responses than their aperiodic counterparts 
(39 & 27 fT/cm for the periodic and aperiodic /a/, P < 0.01; 31 & 22 
fT/cm for the periodic and aperiodic /u/, P < 0.01), with also the 
amplitude difference for the periodic /a/ and /u/ and the aperiodic /a/ 
and /u/ being statistically significant (P < 0.01 & P < 0.05, 
respectively). 

The source localization results for the NIOOm were similar to 
those reported in [Alku et al, 2001] [Makela et al, 2003] with the 
sources of the NIOOm elicited by the periodic stimuli being anterior 
to those elicited by the aperiodic stimuli. 

DISCUSSION 

Here we studied the cortical processing of periodic and aperiodic 
vowels /a/ and /u/, with the former having higher FI and F2 values 
than the latter. We found that replacing the periodic glottal excitation 
with an aperiodic excitation had distinctly different effects on the temporal dynamics of cortical activation elicited by these vowels: while /a/ elicited 
the NIOOm at an invariant latency, aperiodic excitation in /u/ led to a considerably delayed N100m. Furthermore, for both /a/ and /u/, the presence of 
natural glottal excitation was reflected in the amplitude and location of the ECDs for the NIOOm, with the vowels comprising harmonic structure 
resulting in larger and more anterior responses than their aperiodic counterparts. 

The present results corroborate previous observations showing that low-frequency pure tones (<500 Hz) [Roberts et al, 1996] [Makela et al, 2002] 
elicit delayed cortical activation as reflected by the latency of the NIOOm: Here the aperiodic vowel /u/ with relatively low FI and F2 values (330 & 
580 Hz) activated the human auditory cortex considerably later than the aperiodic vowel /a/ with high FI and F2 frequencies (670 & 1000 Hz). 
However, the periodic vowels /a/ and /u/ activated the auditory cortex at an identical latency which appears to provide reasons for attributing the 
NIOOm latency effects to glottal excitation. 

The above observations can be explained in terms of the acoustical properties of the stimuli: For a low-pitched tone to be perceived as equally 
loud as a high-pitched tone, the intensity of the low-pitched one has to be amplified. Conceivably, in human speech production and perception, this 
amplification is provided by the integer multiples of the F0, which increase the sound energy at frequencies above the F0. The human perceptual 
system might be tuned to the simultaneous presence of the F0 and the F1-F2 frequencies, and, consequently, this tuning might lead to delayed cortical 
activity when the F0 information is removed. While emphasizing the link between speech production and perception [Liberman & Mattingly, 1989], 
the present results indicate that caution is advocated if results gained with other than natural speech stimulation are used for drawing inferences about 
the brain dynamics of speech perception, as brain responses are highly sensitive to minute variations in the stimulus material and the production 
methods used. 
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Figure 1. Left: The latency of the NIOOm for aperiodic and periodic vowel 
stimuli. Middle: ECD source locations for the NIOOm in the left and right 
hemisphere. Right: The amplitude of the NIOOm. Grand-averaged data, 
bars indicate standard error of the mean. 
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ABSTRACT 

The cortical areas involved in processing of emotional prosody (EP) such as joy or sadness have been reported to be localized in the basal ganglia 
of the right hemisphere in a patient study. In previous study by f-MRI, it was suggested that emotional prosodic aspects of spoken language was 
dominantly processed by the right hemisphere or the bilateral hemisphere ,which was especially associated with frontal lobes or auditory areas. 
However due to the limited temporal resolution of f-MRI, the difference of laterality between joy and sadness showing at which stage of information 
processing has not been mentioned detailed. In this paper, we investigated about interhemispheric differences of EP using magnetoencephalography 
(MEG) with excellent time resolution . 

In this experiment, a joy face (JF) or a sad face (SF) was displayed after voice which had emotional features of joy (joy prosody : JP) or sadness 
(sad prosody : SP) were presented. Subjects were requested to judge whether emotional features of the voice and the face were identical or not. 
MEGs evoked by emotional voices were measured and significant diffrences of cortical activities associated with processing of emotional feature 
were observed between the right and left hemisphere during the latency including around Nlm. It is suggested that there are interhemispheric 
differences in processing of emotional prosody . 

KEY WORDS 

magnetoencephalography (MEG); emotional prosody; f-MRI; right-hemisphere (RH); single dipole model; time course change 

INTRODUCTION 

The function of conveying the emotional meaning of feelings such as joy or sadness in spoken language is called emotional prosody (EP) 
recognition. Though this function is very important as it improves the quality of daily conversation, the mechanism of EP in the human brain has not 
been clarified. Previous researcher has shown EP to be mainly involved in the right basal ganglia, by analyzing psychological experiments or clinical 
data of aphasia or brain damage patients [ Ross E, 1981 ]. A detailed study of the cortical areas involved in EP was recently reported, utilizing 
developments of imaging research by f-MRI, with excellent space resolution. In those studies, cortical areas such as the right-temporal and a frontal 
lobe were activated when subjects read sentences or words emotionally [ Mitchell RLC, 2003 ]. Kotz et al. demonstrated the involvement of the 
bilateral temporal and frontal lobes in the processing of EP by comparing numbers of semantically read sentences and emotional read sentences. 
There are several reports using f-MRI studies that have suggested processing of EP is carried out in the right hemisphere (RH) or bilaterally. 
However the difference of cortical activities between joy prosody (JP) and sad prosody (SP) has not been mentioned clarified. In this report, we 
measured MEG to observe cortical activities in the both hemispheres associated with two kinds of EP processing JP and SP. 

METHODS 

Subjects : Subjects were 1 male and 5 females aged 21-33 years (mean 27.7 ±4.75), right handed . They had no a history of hearing difficulties 
or neurological disease. 

Stimulus : Auditory stimuli consisted of two emotional voices which expressed joy or sad mood consisting of a first name constituting three 
moras and the addition of the words “ Mr. or Mrs.” in Japanese consisting of two moras, e.g. “Ya-ma-da-sa-n “ [Sogabe Y.et al ; 2003] . The 
names were spoken by a trained speaker ( male 22.0 years) and the names used were those not so much used in daily life to avoid any possibility that 
is the response was decided by an emotional image characteristic of the individual’s experience with that name. The speaker and subjects spoke the 
same Kansai dialect. In the first experiment the subjects heard congruent or incongruent voices that were JP or SP. The sound pressure level of JP 
and SP were arranged to be identical. These voices were presented for 1200ms at random. The subjects were requested to select a drawn face that 
was presented for 1000ms and expressed joy or sadness emotions after hearing an emotional voice; the subjects then pushed a switch within 2000ms 
to decide if the face and the voice expressed congruent or incongruent emotions. We started estimating the onsets of each emotional voice during 
1500ms. 

MEG measurement and Analysis : MEGs were recorded by a 122ch whole-head neuromagnetmeter (Neuromag-122™). Evoked magnetic 
fields were sampled at 400Hz after bandpass filtered between 0.03 and 100 Hz and averaged more than 100 times for each prosody. Interaction of 
response of signals with each prosody during 100-150 ms, 150-250ms, and 250-400ms were also estimated by selected the channels which cover the 
temporal regions in both RH and LH, moreover the RMS ( root mean square ) of signals with each prosody during the latency of Nlm were 
compared ,too. We also estimate equivalent current dipoles ( ECDs ) using a single dipole model. The numbers of the dipoles localized in LH and 
RH ( goodness of fit >70% ) were compared in order to observe the interhemispheric differences of cortical activation . The RMS were calculated 
for all subjects by selected the sensors which was the same location and numbers of RH as that of LH involved in auditory area. Moreover , we 
compared the numbers of the dipoles localized in LH and RH 
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RESULTS 

The mean of the correct rate in this task was 94.1 % (±0.004 ). We 
compered the mean of the integration of responses (IOR ) of signals with each 
prosody , JP was increased significantly more than SP during 100-150ms ( p < 
0.05* JP: 406 ± 59.7 fT/cm-s SP: 288 ± 39.9 fT/cm-s ) (Fig. 1 ). Though no 
significant difference was observed during the latencies of 150-250 ms and 25- 
400 ms. Moreover we compared the mean of RMS of the latency of Nlm 
between JP and SP, the mean of the latency of JP was more significantly ealier 
than that of SP ( p<0.001** JP: 118.3 ± 2.42 ms SP: 129.8 ± 2.7 ms ) ( Fig. 3 
) and the mean of RMS during the latency of Nlm of JP was more significantly 
increased than that of SP ( p<0.001** JP: 14.8 ± 1.87 fT/cm SP: 12.1± 1.81 
fT/cm ). And then, we compared the mean of RMS of the latency of Nlm 
between LH and RH, the mean of RH was more than that of LH ( p<0.05* 

LH: 11.5 ± 4.22 fT/cm RH: 15.4 ± 7.72 fT/cm ) ( Fig. 2 ). In addition to these 
analysis, we compared the change of number of dipoles localized in between 
LH and RH within JP and SP during the latency of the 

Nlm, the number of dipoles of RH changed largely between JP and SP ( Fig. 3 ). 
The cortical activation areas of EP were no significant between JP and SP and 
between LH and RH during the latency of Nlm . 



Fig 1. The comparing of the interaction of response 
of signals between JP and SP during 100 and 150ms 
( JP>SP p < 0.05*) 



Fig 2. The comparing of RMS between LH and RH at 
the latency of Nlm ( L < R p<0.05*) 


Fig. 3 The number of dipoles estimated for JP and SP during 100 
and 150ms 


DISCUSSION 

Our results showed that the cortical activities for JP were larger than that of SP, in the latency range between 100 and 150 ms. Kotz SA. reported 
that the activation was stronger for the positive intonations than the negative. In our experiment, JP has stronger intonation than SP. It is possible 
that the differences in the cortical activity observed in this study depend on the differences of intonation. In terms of the laterality, RH was more 
activated than LH in agreement with the previous study by f-MRI. However, Imaizumi et al.(1998) reported that cortical activities associated with 
processing of pitch and intonation is dominatly carried out in RH. It is difficult to decide whether these interhemispheric differences reflect the 
differeces of pitch or EP. More precise study about the effect of acoustic feasture on the cortical activities is needed. 
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Dynamic neuromagnetic responses to auditory motion: A novel index for evaluation of 

attention 
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ABSTRACT 

Our previous studies have demonstrated that there is a motion related magnetic response evoked by only moving sound but not stationary sound. 
The motion related response reflects the dynamic activities of brain processing of auditory motion. This study focuses on the effect of attention on the 
response. Six normal hearing subjects were studied using a 151-channel whole-cortex magnetoencephalography (MEG) system. Four sound stimuli 
were designed: two sounds moved from left to right (rightward) and the other two sounds moved from right to left (leftward). Two sounds moving in 
the same direction were separated with an attention task. The attention task was a mathematical calculation. The results showed that when subjects 
listened to the moving sound, there was a clearly auditory motion related responses at a latency of 649± 14.2 ms. However, when subjects were doing 
the calculation while listening to the moving sounds, the motion related response completely disappeared. Strikingly, there was a dent at a latency of 
600 ms. To our knowledge, this is the first study showing a neuromagnetic response which can be clearly elicited or inhibited by an attention task. 
We consider that the motion related response has the potential to be an objective index for the study of attention. 

KEY WORDS 

Magnetoencephalography (MEG), auditory evoked magnetic field (AEF), auditory motion, attention, brain. 

INTRODUCTION 

Previous work using event-related potentials, MEG, and more recently neuroimaging techniques has provided fundamental information on the 
neural correlates of attention in the central cortical system. Giard and colleagues proposed an adaptive filtering mechanism for selective auditory 
attention that can be flexibly and dynamically tuned depending on the attentional demand [Giard, 2000]. Recent findings indicate that mismatch 
negativity (MMN)/mismatch magnetic fields (MMF) elicited in auditory cortex by intensity deviants are not strongly automatic but rather can be 
gated or suppressed if attention is strongly focused elsewhere [Woldorff, 1998]. Attention can modify the activity of two different areas in the 
supratemporal auditory cortex. Both attention effects as alterations of the exogenous evoked response components: the earlier effect as changed 
activity in neurons underlying NIOOm to relevant tones and the later effect as a modification of P200m to irrelevant tones [Rif, 1991]. Our previous 
study has demonstrated that there is a motion related magnetic response evoked only by moving sound but not stationary sound. Since this response 
reflects the dynamics of brain processing of sound motion, it is very interesting to investigate if attention will affect this response. Though MEG has 
been used in attention study and neuromagnetic changes have been found in attention task, however, to our knowledge, the effect of attention on 
sound motion evoked response has not been studied. 

METHODS 

Subjects: Six normal hearing adults (2 women and 4 men, aged 24-39 years, with a mean age of 30 years) were studied. All subjects were free 
of known neurological disorder, and had normal vision. Informed consent for the study was obtained from all subjects. 

Stimuli: The sound motion was produced by custom designed software using Direct 3D sound technology (Microsoft Company, Redmond), 
sound stimuli were designed: two sounds moved from left to right and the other two vice versa. The two sounds moving in the same direction were 
separated with an attention task. The attention task was 100 simple mathematic calculations. Based on our previous results 4 , sound motion 
parameters were chosen to ensure that subjects could clearly perceive sound motion. The sound strength was 80 ± 6 dB, interval time was random 
with an average 1.2 ± 0.1 s. 

MEG Recordings: A 151-channel whole cortex CTF OMEGA system was implemented in this study (CTF Systems Inc., Vancouver, Canada). 
The MEG measurements were performed in a magnetically shielded room with a total system white noise level below 10 fT/VHz. The localization of 
the subject’s head relative to the sensor array was accomplished with three small coils affixed to the nasion and pre-auricular points. The coils were 
simultaneously activated at different frequencies and their positions were determined from their magnetic signals. The system also allows for head 
localization to an accuracy of better than 2 mm. The three coils were identical to the three fiduciary points used in MRI. Data was recorded with 3rd 
order gradients noise cancellation. The sampling rate of data acquisition was 1250 Hz. One hundred epochs were averaged for one session. The 
analysis window was 0.2 s before and 1.2 s after the stimuli, and DC was offset using a pre-stimulus period (0.2 s) as the baseline. Peak latency was 
measured for each recognizable component by a computer cursor. Spatial temporal dipoles were calculated at each component. 

Data analysis: Peaks in the averaged signal were identified using the CTF DataEditor. The source location corresponding to each peak latency 
was estimated individually using the CTF DipoleFit. 

RESULTS 

Auditory evoked magnetic fields (AEF) were successfully recorded from all six subjects. The typical averaged waveforms are shown in Figure 1. 
In all six subjects, moving sound along elicited two clear responses (Figure 1, B): one response was identified at a latency of 100 ± 8.2 ms (Ml00); 
and the other response at 649± 14.2 ms (MM). There were magnetic activities after the Ml00, we called it long-term component (or magnetic 
residency). The contour maps of the first and the second responses indicated consistently two dipoles in the bilateral superior temporal cortices. 

Interestingly, when subjects were doing calculations, the moving sound could only elicited one clear response which was identified at a latency of 
107± 9.5 ms (M100). There were some magnetic activities before the M100 in 4 subjects (4/6); however, the morphology of those magnetic 
activities was variable. The long-term component, magnetic activities after the Ml00, were observed in all subjects; however, there were large inter- 
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individual differences. The contour maps of the first responses indicated consistently two dipoles. The most striking finding was a dent in the 
waveform at a latency of 600 ± 4.3 ms (Figure 1. A). 

A comparison of averaged waveforms between the pure moving sound and the moving sound combined with calculation task suggested that the 
attention task mainly affected the motion related response (MM). In 4 out of the 6 subjects, the long-term component elicited by the moving sound 
combined with calculation task was much stronger than that elicited by the pure moving sound. 

Though we used two moving directions in this 
Ddlt study, the effect of the moving direction on the 

magnetic response was not significant. 

DISCUSSION 

Our previous work using MEG has demonstrated 
that one response is only elicited by moving sound but 
not stationary sound. Dipole based source localization 
indicates that the right parietal cortex is involved in 
sound motion processing [Xiang, 2002]. This study has 
further confirmed our previous report; therefore, we 
consider that the right parietal cortex, in association 
with the left and right superior temporal cortex, forms a 
network to process sound motion information. This 
result is in agreement with previous report [Griffiths, 
2000]. Previous functional imaging work using positron 
emission tomography (PET) and functional magnetic 
resonance imaging (fMRI) has demonstrated a network 
of areas that are active during the perception of sound 
B movement. The network includes bilateral premotor 

areas and the right parietal cortex. The frontal activation 
includes both dorsal premotor activity in the region of 
the frontal eye fields and discrete ventral premotor 
activation in an area corresponding to primate areas for 
multimodal spatial analysis and motor planning. The 
right parietal activation includes both superior and 
inferior parietal cortex. 

To our knowledge, there is no report focusing on the 
effect of attention on the auditory motion related response. Since the auditory motion related response could be totally inhibited by a calculation 
task, we consider that the neuromagnetic response to auditory motion has the potential to be a novel index for evaluation of attention. The effect of 
attention on auditory motion has been studied using event-related potential (ERP). Stimuli moving in the attended direction elicits ERPs that are more 
negative than ERPs to stimuli moving in the unattended direction. This difference starts around 140 ms post stimulus onset for visual and around 120 
ms for auditory stimuli [Beer, 2004]. The ERP results strongly support our results: the attention task did not significantly affect the early response 
(Ml00), but significantly inhibited the later response (MM). It is not clear how and why the MM nearly disappeared in the stimulation of moving 
sound combined with calculation task. We consider that it would be very interesting to further confirm that the motion related response is a novel 
objective parameter for studying the cerebral mechanisms underlying attention in the auditory system. 
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Distinct Pathways Involved in Sound Localisation of Still and Oscillating Acoustic Stimuli: A Human 

fMRI/MEG Study 

Brunetti M. 1,2 , Belardinelli P. 1 ’ 2 ’ 3 ’ 4 , Del Gratta C. 1 ’ 2 ’ 3 , Pizzella V. 12,3 , Della Penna S. 12,3 , Ferretti A. 1,2,3 ? 
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institute of Advanced Biomedical Technologies, University of Chieti; department of Clinical Sciences and Biomedical Imaging, University of 
Chieti; 3 INFM- National Institute for the Physics of Matter, Coordinated group of Chieti; 4 ECONA (Interuniversity Center for the Cognitive 
Processing in Natural and Artificial Sistems) and Department of Psychology, University of Rome“La Sapienza”. 


This study is aimed at investigating the role of the caudal superior temporal gyrus in sound localization. Furthermore, on the basis of the results 
obtained in a previous study [1], we investigated if the crossing of the vertical auditory meridian determines a different activation with respect to the 
localization of sounds incoming from different locations in the same hemi-field, as a consequence of a “meridian effect”. Two protocols were used in 
both fMRI and MEG procedures: in the first one we used sounds coming from 5 locations (0° and 45° RIGHT ear, frontal direction, 45° and 0° LEFT 
ear). The experimental conditions were: MIXED (a complex sequence of sounds incoming from the five directions), LEFT (from 0° on the left), 
RIGHT (from 0° on the right) and FRONTAL (from 90°). In the second protocol, the LEFT, RIGHT and FRONTAL conditions were characterized by 
sequences of sounds alternatively oscillating between 2 locations separated by an angle of ± 20°. The MIXED condition was the same as in the first 
protocol. For both protocols, in MEG, the magnetic field showed, a bilateral activation over the right and left temporal regions, peaking at about 130 
ms (mean latency 136 ms, Nlm) after the stimulus presentation and a dipolar pattern over the right inferior parietal lobule, peaking at 180 ms. In 
fMRI, different activations were observed in the STG and the supramarginal gyrus across the four conditions, suggesting an influence of the 
“meridian effect”. Comparison between conditions revealed that the activation in the caudal STG was significantly larger in the MIXED condition. 
The results suggest an important role of the caudal portion of STG and influence of the “meridian effect” in sound localization. 

[1] Ferlazzo, et al. The Quarterly Journal of Experimental Psychology, 2002, 55A(3), 937-963. 


Short-term Habituation of Neuromagnetic Responses to a Short Click Train 

F.W. Carver 1 , M.C. Feeney 1 , J. Mitchell-Francis 1 , T. Holroyd 1 & R. Coppola 12 
*MEG Core Facility, NIMH, Bethesda, MD, USA. 2 Clinical Brain Disorders Branch, NIMH. 

Deficits in short-term habituation, or sensory gating, of responses to successive stimuli are observed in 
schizophrenia and other disorders. A paired click paradigm is used to study sensory gating, with the relative 
amplitudes of the P50 responses to each click used as a measure of habituation. Difficulties are inherent in 

40 

this measure because the P50 can be obscured by components such as the N1/P2 complex and the mid- ? 
latency gamma response. We present here an MEG experiment designed to produce a more stable measure | 3 ° 
of short-term habituation by employing a greater number of stimuli and by using time-frequency analysis to ~ 
distinguish the habituation properties of overlapping responses. Eight normal volunteers listened to 80 trains 
of five binaural clicks at a 250ms ISI, with each train separated by 4 to 6secs. Time-frequency analysis of the ,0 
resulting evoked fields from 68 temporal channels was performed using a Stockwe 11-transform. The relative 

-o 

change in power within each frequency band was determined by dividing post-stimulus power by an average 
of pre-stimulus activity. The figure shows power change across all subjects plotted on a thresholded log scale. Distinct gamma band responses occur 
after each click, with the first displaying double the amplitude of the remaining four. In the range from 6-16Hz, related to the N1/P2 complex, activity 
is below threshold after the second click. In addition, a large amplitude component at the stimulus frequency of 4Hz lasts the entire duration of the 
click train. These results demonstrate the utility of time-frequency analysis for distinguishing the habituation properties of evoked responses and will 
serve as a baseline for future studies with patient populations. 
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Cortical and Sub cortical processing of binaural pitch - an MEG study 

Maria Chait*, David Poeppel*, and Jonathan Simon ** 

*Neuroscience and Cognitive Science Program, Cognitive Neuroscience of Language Lab, Univ. Maryland, College Park. **Depts 

Electrical and Computer Engineering and Biology, Univ. Maryland, College Park 

Huggins dichotic pitch (HP) is an intriguing auditory phenomenon. It is generated by presenting a random noise signal to one ear, and the same noise 
with a n phase shift over a narrow frequency band to the other ear. The percept is that of a faint tonal object (corresponding to the center frequency of 
the phase-shifted band) embedded in noise. Here we compare the cortical auditory evoked responses to HP with those of tones embedded in noise. 
These perceptually similar but physically very different stimuli are interesting tools for the study of pitch processing in auditory cortex and it’s 
relationship to perception. The signals were 1500 ms long, consisting of 1000 ms interaurally correlated or uncorrelated white noise followed by 
either HP (with a center frequency of 200, 400, 600 or 1000 Hz) or a tone of those frequencies embedded in matched noise (TN). Auditory cortical 
responses were recorded using a 160 channel whole head MEG system (KIT, Kanazawa, Japan) while subjects performed a tone detection task. The 
electrophysiological response latencies provided qualitatively different information than the behavioral response times. Each participant showed a 
clear onset response at ~160ms post HP/TN onset, modulated by perceived tone and lateralized to the left, with a characteristic Ml00 spatial 
distribution. These response properties were similar for both kinds of stimuli, but significantly earlier (~30 ms difference in peak latency) for HP 
trials when the pitch-onset was preceded by correlated noise, and significantly later (~20ms) for HP trials relative to TN trials when preceded by 
uncorrelated noise. These data support brainstem cross-correlational models of binaural interaction, as well as cortical effects on the process of 
extraction of tonal objects from background noise. The significance of these findings to the interpretation of the Ml 00 auditory onset response and to 
models of cortical expansion of latency disparities is discussed. 

MC and DP are supported by NIH R01DC05660. 


Dynamic Cortical Activation During Phoneme Discrimination 

R.E. Frye, J. McGraw-Fisher, J. Liederman, A. Dale, E. Halgren 
MGH Martinos Imaging Center, Charlestown, MA, USA 

It is well known that categorization and discrimination of phonemes along a voice onset time (VOT) continuum is abnormal in persons with dyslexia. 
Neuroimaging studies implicate bitemporal dysfunction in dyslexic readers. The majority of this evidence is based on absolute activation measures 
over long time windows, thereby limiting the examination of the dynamic evolution of cortical activation in the left and right hemispheres. 
Neurophysiologic investigations of phonological processing in normal readers with magnetoencephalography (MEG) suggest early primary auditory 
cortical activation bilaterally with late lateralization to the left temporal area for linguistic, but not non-linguistic speech-like, stimuli. However, the 
dynamics of bitemporal activation in response to phonemes along a VOT continuum has not been studied in detail. A paradigm previously shown to 
identify temporally modulated dynamics of both within and between phoneme category discrimination was modified for use in the MEG 
environment. The VOT continuum contained eleven syllables divided into two phoneme regions and one boundary region. One of the phoneme 
regions was sufficiently large so as to allow division into prototypical and non-prototypical subregions. Two 170ms phonemes were binaurally 
presented sequentially with a 10ms separation of silence. A NeuroMag VectorView system, with 306 channels within an Imedco magnetically 
shielded room, recorded MEG fields from the entire scalp at 601 Hz. Sensitivity-normalized continuous current estimates of the MEG data produced 
statistical parametric maps of brain activation every 4ms for ten normal participants. In general, bilateral temporal-parietal activation was observed 
with early right hemisphere activation progressing to later left hemisphere activation. Matching phonemes categorized as more prototypic show 
greater and prolonged left hemisphere activation than phonemes categorized as boundary and non-prototypical phonemes. 
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Neuromagnetic correlates of auditory stream segregation 

A. Gutschalk, C. Micheyl, M. Scherg, A. Rupp, A.J. Oxenham 
University of Heidelberg, Germany & Massachusetts Institute of Technology, USA 


Alternating-tone sequences can be perceived as one coherent stream or segregated into two parallel streams, depending on frequency and inter¬ 
stimulus interval [1]. In this study, we examined whether frequency specific adaptation [2] of auditory evoked fields (AEF) reflects how tone 
sequences are perceptually organized. Fourteen listeners were tested with a 122-channel whole-head MEG system. Spatio-temporal dipole modeling 
was used to derive source waveforms for the middle- and long-latency AEFs. Stimuli were ABA_ tone triplets, where A and B represent different 
frequencies and _ indicates a silent inter-triplet interval. In experiment 1, B was fixed at 1000 Hz and A was varied from 1000 to 2000 Hz in 2 
semitone steps. The tone duration was fixed at 100 ms and the inter-tone-interval at 50 ms. Results show an increase in Plm and Nlm source strength 
corresponding to the B tones as the frequency separation between A and B was increased beyond 2 semitones. This increase in B-tone response 
magnitude mirrored the increase in perceptual segregation measured psychophysically. For earlier components (N19m and P30m), no systematic 
effects of A-B frequency separation were observed. In experiment 2, subjects listened to ABA_ sequences with an A-B separation of 4 or 6 
semitones, which produced an ambiguous time-varying percept, and they reported their percept (one or two streams) in real-time. Plm-Nlm 
magnitudes were slightly but significantly larger during times when subjects reported hearing segregation rather than integration. Overall these 
results can be explained by assuming that auditory stream segregation is related to stimulus specific adaptation. The dissociation of the Plm-Nlm 
and the earlier N19m-P30m indicates that the effect originates, at least in part, from the auditory cortex. Research supported by the DFG (GU 593/2- 

i). 

[1] Bregman AS et al. (2000) Perception & Psychophysics 62:626-636. 

[2] Butler RA( 1972) Electroenceph. clin. Neurophysiol. 33:277-282. 


Enhanced Response to a Mistuned Sound in Human Auditory Cortex 

H. Hiraumi, Y. Naito, T. Nagamine*, H. Fukuyama* J. Ito 
Department of Otolaryngology- Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Japan 
*Human Brain Research Center, Graduate School of Medicine, Kyoto University, Japan, 

Human auditory system has an ability to segregate a specific sound from background environmental sounds, but its neuronal mechanism is not 
understood completely. The comparison of response to a harmonic sound and a mistuned sound is one method to investigate its mechanism [1]. 
Fisteners to a harmonic sound recognize only one pitch, but when the frequency of one component is shifted to some degree, the mistuned tone is 
recognized separately. In the present study, the auditory evoked magnetic fields (AEFs) to a harmonic sound and a mistuned sound were recorded 
with a whole-head magnetoencephalography (MEG) system. The harmonic sound was composed of a 200 Hz tone and its 2 nd to 12 th harmonics, 
while the mistuned sound had a 696 Hz tone instead of a 600 Hz tone in the harmonic sound. These two sounds were presented binaurally one after 
the other with a constant interstimulus interval of 1 s. In all participants, reliable 100 ms responses were obtained, and their equivalent current dipoles 
(ECDs) were estimated. The strength of the ECDs evoked by the mistuned sounds was significantly larger than that of the ECDs evoked by the 
harmonic sounds, which suggests that some part of sound segregation may be achieved before the level of N 100m component. 

[1] Alain C, McDonald KF, Ostroff JM, Schneider B. 2001. Age-related changes in detecting a mistuned harmonic. J Acoust Soc Am 109,2211- 
2216. 
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Auditory evoked responses in human newborns recorded by MEG and EEG 

Manuela Holst, Hari Eswaran, Curtis L Lowery, Pam Murphy, Hubert Preissl 
SARA Research Center, Department of Ob/Gyn, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA 

Based on our previous fetal studies, we investigated auditory evoked responses (AER) in newborns by a combined MEG-EEG study. 19 healthy 
neonates (38 recordings) with conception ages ranging from 38 to 44 week were studied. 17 had both MEG and EEG recorded simultaneously 
whereas 2 had MEG only. MEG was measured with the SARA (Squid Array for Reproductive Assessment) device and EEG with Ag/AgCl 
electrodes. For the newborn investigation a custom designed cradle was attached to the MEG system to allow the measurements in reclining position. 
We performed an odd-ball study on each ear separately using frequent tones of 500 Hz and infrequent tones of 700 and 1 kHz. The interstimulus 
interval was 2s (+/- 0.5s randomization). Interfering heart signals were removed offline. Signals were averaged over Is epochs (0.2s pre-stimulus and 
0.8 post-stimulus) around frequent tones. 31 of 38 MEG recordings showed evoked responses and 18 of 32 EEG recordings had identifiable signals. 
For peak latencies analysis we chose the largest peak. MEG showed an average response latency of 251ms for left ear and 267ms for right ear. 
Electrical AERs showed an average latency of 253 ms for the left ear and 252 ms for the right ear. In conclusion, the SARA device designed for fetal 
studies, could be used for newborn recordings. The combination of EEG and MEG allows comparison of results from both techniques in the newborn 
period thus helping in better quantification of AER latencies in the fetal period. 

Acknowledgement: Funded by grants from NINDS/NIH and NIBIB/NIH. 


Gamma activity revealed in humans during working memory retention of virtual pitches 

O. Jensen, M. Schulte, H. van Dijk 

F.C. Donders Centre for Cognitive Neuroimaging, Nijmegen, the Netherlands 

It has been suggested that neuronal oscillatory synchrony in the gamma band (30-80 Hz) serves to ‘bind’ features of objects in order to form 
representations. In addition, gamma activity has been proposed to play a role in sustaining neuronal activity during working memory retention. Our 
aim was to investigate if neuronal synchrony in the gamma band plays a role in maintaining complex auditory representations. In a delay-match-to- 
sample task subjects were supposed to maintain pure tones, complex tones and virtual pitches. The virtual pitch was composed of only the harmonics 
of the fundamental pitch (e.g. 1000, 1250, 1500 Hz). Most subjects will perceive the fundamental (250 Hz) as a virtual pitch thus requiring ‘binding’ 
of the neuronal representations of the harmonics. We recorded the brain activity from 12 subjects using a whole-head MEG system with 151 sensors 
(CTF). Using a wavelet technique we analyzed the oscillatory signals (planar gradient). When comparing virtual pitches to non-virtual pitches we 
found induced oscillatory activity constrained to the gamma band (28-32 Hz) during memory maintenance over left frontal areas. The gamma activity 
was not phase-locked to the stimuli. Our findings contradict reports questioning if induced gamma activity be measured by MEG. We propose that the 
induced gamma activity reflects active maintenance of complex auditory representations requiring binding. 
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Neuromagnetic Evaluation of Binaural Unmasking in Normal Subjects and Patients with Comatose 

Survivors After Severe Head Injury 

A. Kanno, N. Nakasato, T. Kawase*, T. Sasaki, H. Nagamine, S. Fujiwara 
Kohnan Hospital Ryogo Center and *Department of Otolaryngology, Head and Neck Surgery, Tohoku University, Sendai, Japan 

Background: Binaural unmasking refers to the improvement in intelligibility under conditions of masking when a tone is presented out of phase 
rather than in phase. Only subjective methods have been used previously to test binaural masking. Methods: Binaural unmasking was evaluated using 
auditory evoked MEG in 8 healthy subjects and 8 comatose survivors after severe head injury in which bihemispherical auditory responses were 
observed for each left and right monaural stimulus. Peak latency and amplitude of the Nlm to tone bursts of 250 Hz were measured under S 0 N 0 (both 
stimulus and masker in phase) and S^No (stimulus and masker out of phase) conditions. The level of tone bursts was swept by 5 or 10 dB steps, from 
the level of 20 dB SL (sensation level compared to the psychophysical threshold under the S 0 N 0 condition) in normal subjects, and from 80 dB SPL 
(sound pressure level) in patients, until no significant response could be observed. Identical background noise was presented to both ears 
continuously at 50 dB SPL. Results: In all the normal subjects, Nlm to stimuli at or above the psychophysical threshold were found bilaterally except 
in one who had only right hemispheric Nlm. Nlm for the S^No stimulus had larger amplitude and shorter latency than that for the S 0 N 0 stimulus in 
each hemisphere and at each sound level. Neuromagnetic binaural unmasking was greatest around the threshold level, corresponding to 
psychophysical binaural unmasking; became smaller with greater stimuli, indicating the suprathreshold unmasking effect; and disappeared at around 
15-20 dB above the threshold. Neuromagnetic binaural unmasking was also found in 6 of the 8 patients. The two other patients showed little 
difference between S 0 N 0 and S^N 0 conditions. Conclusions: Psychophysical binaural unmasking can be quantitatively evaluated by MEG in the 
auditory cortex level of the bilateral hemispheres. 


Language MEG Study of Phonological Contrasts 

Nina Kazanina, Colin Phillips 
University of Maryland at College Park, USA 

This study investigates the availability of phonological information to auditory cortex using sounds drawn from a Russian /da/-/ta/ continuum, which 
shows a pure voicing contrast and a category boundary that is acoustically very different from the corresponding boundary in English. Brain activity 
was recorded using a 160-channel MEG system while 10 Russian speakers listened passively to syllables from the /da/-/ta/ continuum presented in an 
oddball paradigm (7:1 standard:deviant ratio). Multiple tokens of each category (voiced: from -40 ms to -24 ms; voiceless: from -8ms to +8ms) were 
presented, each category being standard in one block and deviant in the other one. A magnetic mismatch field (MMF) was elicited for each of the 
categories in the block when it was the infrequent deviant sound compared to the same category as standard. Due to the locus of the Russian category 
boundary at around -15ms VOT, compared to the typical English boundary of +30ms VOT, this study suggests that mismatch responses elicited by 
voicing contrasts in earlier studies on English ([1]) cannot be uniquely due to the fact that the English category boundary corresponds to a low-level 
timing threshold based on neuronal refractory periods, and demonstrated in non-human mammals [3]. A further test of the contribution of abstract 
phoneme representations to the mismatch response comes from a study of Korean, a language that employs the same phonetic voicing contrast as 
Russian, but does not use the contrast phonemically, to encode lexical contrasts. 

[1] Sharma, A., R, Dorman, M.F. 1999. Cortical auditory evoked potential correlates of categorical perception of voice onset time. JASA 106, 1078- 
1083. 

[3] Steinschneider, M., Schroeder, C.E., Arezzo, J., Vaughan, H.G. 1994. Speech-evoked activity in primary auditory cortex: Effects of voice onset 
time. Electroencephalography and Clinical Neurophysiology, 92, 30-43. 
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Neuromagnetic responses to native and non-native speech sounds in Japanese speakers 

S. Koyama, A. Gunji, R. Akahane-Yamda*, R. Kakigi 
National Institute for Physiological Sciences, Japan; *ATR Human Information Laboratories 

In our previous study we recorded MMNm, a magnetic counter part of Mismatch Negativity, from Japanese speakers using /ra/, /la/ and /wa/ sounds 
(Koyama et al., 2001 presented at Society for Neuroscience). The /w/ category locates adjacent to /l/ and /r/ sounds in F2-F3 space for English. Only 
/w/ occurs in the Japanese phonetic system. Using a passive oddball paradigm we recorded MMNm supposed to be associated with detecting change 
in a sound sequence using memory traces developed by repetitive standard sounds. We found that the MMNm amplitude for /wa/ deviants embedded 
in either /la/ or /ra/ standards was smaller than that for either /ra/ and /la/ deviants embedded in /wa/ standards. We interpreted this result that native 
speech sounds /wa/ create more enhanced short-term memory traces for standard sounds than non-native speech sounds /ra/ and /la/. To extend this 
finding we recorded MMNm from Japanese speakers using /tha/ (voiced) and /za/ in the present study. There is no interdental fricative in the 
Japanese phonetic system and /tha/ (voiced) is often perceived as /za/ (voiced alveolar fricative, present in the Japanese phonetic system) by Japanese 
speakers. Subjects were 10 native speakers of Japanese (right handed, four females and six males, aged 21-36) and stimuli (/tha/, /za/) were male 
speech sounds spoken by a native American English speaker with a duration of 200 ms (onset-onset stimulus interval, 600ms; binaural presentation). 
A clear MMNm with a peak latency of 250-300 ms was bilaterally observed only for /tha/ deviants but not for /za/ deviants. Thus the present findings 
provide further evidence that the way in which standard sounds stored as memory traces, differs between native and non-native speech sounds and 
native speech sounds are able to create more robust memory traces than non-native speech sounds. 


Neuromagnetic activation mirrors auditory perceptual asymmetry 

A. Kult 1,3 , D. Pressnitzer 2 , S. Supek 3 , A. Rupp 1 
department of Neurology, University of Heidelberg, Germany 

2 Ircam-CNRS, Paris, France 

3Department of Physics, Faculty of Science, University of Zagreb, Croatia 

Human listeners are highly sensitive to temporal asymmetry of sounds [1]. When a damped exponential with a 4-ms half-life time (HLT) is used to 
modulate a sinusoid, it reduces the sound quality typically associated with the carrier. When the modulator is reversed, producing a "ramped" sound, 
the sound quality of the carrier is more salient. Here we studied the relation of the perceived properties of asymmetry and the neuromagnetic 
representation in the auditory cortex. Ramped and damped sounds with five different HLTs, i.e. 0.5, 1,4, 16 and 32 ms, were presented. Sinusoidal 
and noise carrier were used. Auditory evoked fields (AEFs) were acquired with a whole-head MEG system in 10 normal hearing subjects. Perceptual 
data were obtained with a paired-comparison technique to derive a relative scale of the salience of the carrier quality for each stimulus. Spatio- 
temporal dipole modeling showed that envelope coding was reflected by the P30m activity whereas the perceptual asymmetry correlated with later 
P50m and NIOOm activity. Furthermore, the amount of the neurophysiological asymmetry paralleled the perceptual scales, with a maximum for 
stimuli with a 4-ms HLT. The asymmetry had been observed in single neurons [3] but our results represent the first neurophysiological ensamble 
correlate of the phenomenon. 

[1] Patterson RD, 1994a, J Acoust Soc Am, 96: 1409-1418 

[2] Kult A et al., 2003, Neuroimage CD-ROM, 19 (2) 

[3] Pressnitzer D et al., 2000, Hearing Research, 149, 155-166 
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Trial-by-trial habituation in the primary auditory cortex 

A.C. Leuthold and T.W. Wilson 

Brain Sciences Center, Veterans Affairs Medical Center, Minneapolis, MN, USA 
Departments of Neuroscience and Psychology, University of Minnesota-TC, USA 

To better characterize the effect of habituation in the auditory cortex, MEG recordings are made during many short sessions over several weeks on 
the same subject. This approach allows the averaging of epochs from different sessions but of similar time placement within each session. To obtain 
reproducible head placement across the sessions, a rigid foam helmet was constructed to fill the space between the subject’s head and the sensor. The 
stimulus is a 2 kHz tone of 50 ms duration, including a 20 ms taper at the leading and trailing edges, presented with a random inter-trial interval from 
1 to 2 seconds. The project will be completed with 100 sessions of 100 trials, allowing sufficient signal/noise to view an averaged epoch from as 
little as one trial from each session. To date, 60 sessions have been acquired and 30 have been analyzed. Reduction in amplitude of the primary 
response is seen to fall off significantly within the first 20 trials and continue to decrease throughout the 100 trial session. Results of sufficient detail 
to see changes within the first 10 trials will be available by the time of the conference. 

This work was supported by the MIND institute. 


Preattentive memory-based comparison of pitch: An MEG study of the Mismatch negativity to pitch 

changes 

B. Maess 1 , T. Jacobsen 2 , E. Schroger 2 

1 Max-Planck-Institute for Human Cognitive and Brain Sciences, Leipzig, Germany 
2 BioCog - Institute of Experimental Psychology, University of Leipzig, Leipzig, Germany 

Preattentive memory-based detection of changes in pitch is reflected by the Mismatch Negativity (MMN) [1]. Here, the temporal course and neural 
substrate of genuine MMN and contributions from neural refractory processes was investigated using MEG. The MMN to frequency changes was 
investigated using ten sinusoidal tones of 50 ms length and different frequencies. Stimuli were presented in three blocked conditions: 


Descending deviant 

0550 Hz standard (90%) 

0500 Hz deviant (10%) 

Ascending deviant 

1072 Hz standard (90%) 

1179 Hz deviant (10%) 

Control 

500, 550, 606, 666, 732, 805, 886, 974, 1072, 1179 Hz (10% each) 


18 volunteers heard the stimuli (73 dB SPL, 500 ms SOA) while watching a silenced movie. There were 1500 trials per block, and six blocks per 
session in four sessions. MEG data was artifact rejected and filtered before averaging. Individual volume conductors were calculated based on 
individual Tl-MR data. Two mismatch differences were analyzed: deviant-standard (D-S) and deviant-control (D-C). MEG data revealed results of 
the previous EEG study as the respones to D-S were larger and earlier. A pair of moving dipoles was fitted to magnetic field data within the time 
intervals 90-120 ms and 170-220 ms. Time intervals were chosen according to mean global field power signal. Mean location of the dipoles were 
within the vicinity of the auditory cortex. However, D-C locations were more lateral and more posterior compared to the D-S locations. Mismatch 
activity was mainly produced by temporal sources. A dissociation of contributions from memory-based comparison and neural refractoriness was 
demonstrated. 

[1] Jacobsen, T. & Schroger, E. (2001). Psychophysiology, 38(4), 723-727. 
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Dipole Modeling of the Maturation of the Auditory Evoked Field 

K. Maharajh, D. Rojas, R Teale, M.R. Kleman, M. Reite 
University of Colorado, Health Sciences Center, USA 

The developmental characteristics of the auditory evoked field were studied in a large cross sectional age sample. Previous studies, primarily EEG, 
have suggested the existence of multiple components that vary both spatially and temporally as a function of age [1,2]. Data was collected on 53 
subjects whose age ranged from 9 to 54 years old using a 37 channel Magnes I gradiometer MEG system. To determine the appropriate number of 
sources that may be involved, an orthogonal-triangular (QR) matrix decomposition was performed. Similar techniques, such as the SVD, have been 
used as an initial modeling guess for the number of dipoles to be used [3]. Preliminary results indicate an inverse relationship between age and the 
number of sources, which asymptotically approaches one source per hemisphere at adulthood. 

[1] Takeshita, K., Nagamine, T., Thuy, D.H.D., Satow, T., Matsuhashi, M., Yamamoto, J.,Takayama, M., Fujiwara, N., Shibasaki, H. Maturational 
change of parallel auditory processing in school-aged children revealed by simultaneous recording of magnetic and electric cortical responses. 
Clinical Neurophysiology, 113(2002) 1470-1484. 

[2] Naatanen, R., Picton, T. The N1 wave of the human electric and magnetic response to sound: A review and an analysis of the component 
structure. Psychophysiology, 24:4(1987) 375-425. 

[3] Huang, M., Aine, C.J., Supelc, S., Best, E., Ranken, D., Flynn, E.R. Multi-start downhill simplex method for spatio-temporal source localization 
in magnetoencephalography. Electroencephalography and clinical Neurophysiology, 108 (1998) 32^14. 


The Deutsch illusion: neuromagnetic correlates of stream segregation by pitch and space 

K. Mathiak, H. Menning, I. Hertrich, H. Ackermann 
University of Tubingen, Germany 

Deutsch [1] described that streams of dichotic tones with high and low pitch changing from left to right might be reduced perceptionally such that the 
high tones are perceived at the right ear and low tones at the left ear. Dichotic stimulus presentation in whole-head magnetoencephalography can be 
used to functionally segregate both auditory cortices [2]. We conducted three experiments applying dichotic stimuli changing sides: a) slow pitch 
alternations (1.2 Hz) eliciting no illusion, b) fast alternations (4 Hz) of tone pairs and c) of melodies, both characterized by the Deutsch illusion. 
Mismatch fields were recorded over both hemispheres to pitch deviants at either side. First, two well known effects emerged: higher responses of 
the hemisphere contralateral to the rare event reflected functional segregation and faster right hemispheric responses reflected higher proficiency of 
the right hemisphere to extract pitch [3]. Second, moreover, for the illusionary conditions, contralaterality reflected now the illusionary side and the 
illusion effect for melodies emerged at the left hemisphere only. Illusory spatial perception due to primitive (pitch) and pattern-based (melody) 
stream segregation [4] is reflected at the level of the supratemporal plane. Processing properties, thus, of the auditory areas might underlie the 
Deutsch illusion. We suggest that a trade-off between predominant spectral pitch representation at the left and temporal pitch representation at the 
right auditory cortex [5] elicits the perceptual shift to the right ear of high and to the left of low tones. 

Acknowledgments: Supported by DaimlerChrysler AG, DARPA, and DFG SFB 550/BI. 

[1] Deutsch (1974) Nature, 251:307. [2] Mathiak et al. (2000) BrainResCognBrainRes, 10:125. [3] Mathiak et al. (2002) Neuropsychologia, 40:585. 

[4] Bregman (1990) Auditory scene analysis. [5] Zatorre et al. (2002) TICS, 6:37. 
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Auditory differential processing persist during stage 2 sleep in adults: simultaneous recordings of 

electroencephalography and magnetoencephalography 

J. Matsubayashi, M. Imai, T. Nagamine, H. Shibasaki, H. Fukuyama 
Human Brain Research Center, Kyoto University Graduate School of Medicine, Kyoto, Japan 

Few studies have succeeded in recording the mismatch negativity (MMN) from an adult during stage 2-4 sleep. However, our group previously found 
that the mismatch field (MMF), the magnetic counterpart of the MMN, could be observed during stage 2 sleep in adults in certain conditions. The 
aim of the present study is to show that auditory differential processing persists during stage 2 sleep in adults by magnetoencephalographic recording 
in the standard experimental condition under which electroencephalography fails to detect MMN. We carried out simultaneous recordings of 
electroencephalography and magnetoencephalography from 19 normal volunteers using whole-head 122-channel planar gradiometers. 1000 Hz pure 
tone burst of 80% probability (i.e. standard stimulus) and 1200 Hz of 20% probability (i.e. deviant stimulus) were repeatedly presented to the subject 
binaurally with stimulus onset asynchrony of 498 msec. Both wakefulness and sleep sessions were carried out in a magnetically shielded room while 
the subject sat in a chair watching a silent film or taking a nap, respectively. The data of 11 subjects who had a minimum of 100 responses for 
averages to deviant stimuli were brought to further analysis. In wakefulness session, eight subjects showed both MMN and MMF and two subjects 
disclosed MMF during the latency range of 50-300 msec. In sleep session, although MMN was observed only in one subject, MMF was seen in the 
remaining 10 subjects. This result shows that auditory differential processing persists during stage 2 sleep in adults. 


Training induced plasticity in auditory cortex 

S. S. Nagarajan, D.J. McGonigle, K. Sekihara and I. Hairston 
Biomagnetic Imaging Laboratory, University of California, San Francisco, USA 

The goal of this project is to characterize spatiotemporal plasticity in representations of successively occurring stimuli following longitudinal 
behavioral training. Normal adult subjects were tested before and after training in their ability to discriminate the modulation-rate of trains of four 
tone-pips, at base modulation rates of 5 and 13 Hz. Training involved 10 days (900 trials/day) of practicing rate-discrimination of tone-pip trains. 
Subjects were trained at a base rate of 5 Hz or 13 Hz. Subjects underwent three Magnetic Source Imaging (MSI) sessions (2 pre- and 1 post-training) 
during which auditory cortical responses were measured to passive listening at these modulation-rates. Measurements were obtained from both 
hemispheres using two 37-channel biomagnetometer arrays (4D Neuroimaging Inc, San Diego). Pre-training and post-training MSI data were 
reconstructed using beamfomers and showed reliable responses from auditory cortex in both hemispheres. Subjects trained at a base-rate of 5 Hz with 
simple tone trains improved in their abilities with training. Improvements were found to be specific to the training base-rate. MSI data revealed a 
right-hemisphere specific enhancement in the M200 response of the first tone-pip in a train and to the auditory cortical Ml 00 response of the second 
tone-pip. This enhancement was restricted to trains of simple tone-pips at 5 Hz and no differences were found in responses to trains of tone-pips at 13 
Hz in either hemisphere consistent with the learning. Subjects trained at 13 Hz did not exhibit learning. Surprisingly, these subjects exhibited an 
Ml00 enhancement in responses in both hemispheres to trained and untrained stimuli, suggesting active reorganization of auditory cortex in these 
subjects not necessarily related to the learning. These studies indicate that reliable training induced auditory cortical plasticity can be measured using 
MSI, and that multiple mechanisms are involved in the manifestation of such plasticity. 
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Temporal resolution for bone-conducted ultrasonic perception assessed by mismatch field (MMF) 

S. Nakagawal, R. Takegata2, M. Tonoikel 

1 Institute for Human Science and Biomedical Engineering, National Institute of Advanced Industrial Science and Technology (AIST), 

Japan; 

2 Department of Psychology, Helsinki University, Finland 

Recent studies have suggested bone-conducted ultrasonic hearing aid (BCU hearing aid), which transmits amplitude-modulated ultrasound by bone- 
conduction, is able to be developed for the profoundly deaf. In order to optimize the BCU hearing aid, characteristics of BCU perception should be 
more specified. In this study, the temporal resolution of BCU perception is assessed by mismatch field (MMF). MMFs were recorded in the 
following conditions: (a) 1 kHz air-conducted tone burst (Air), (b) 30 kHz BCU tone burst (BCU), (c) 30 kHz BCU amplitude modulated by 1 kHz 
tone burst (AM-BCU). Each condition consisted of one standard stimulus (75 ms duration, 85%) and three types of deviant stimuli (52.5, 37.5, and 
22.5 ms durations, 5 % each). They were presented at random order, and averaged more than 120 times for each. Stimulus onset asynchrony (SOA) 
was set at 300 ms. Equivalent current dipoles (ECDs) were estimated at MMF peaks in the both temporal regions. ECDs moments were 
significantly different among three kinds of stimuli (Air: BCU: AC-BCU = 1.0: 0.91: 0.53). ECD for BCU and AC-BCU locates more anterior than 
that of Air. Since MMF is generally more related to perceptual properties of stimuli than physical, the current results show the difference between 
air-conducted sounds and BCU in perceptual attribute rather than in the pathway. It is also indicated that the BCU hearing aid has sufficient temporal 
resolution. 


Dipole Source Modeling of the AEF in Pediatric Environments: 

Effects of Head Size, SNR and Sensor Coverage 

E.W. Pang, L. Bakhtazad, W. Gaetz, D. Cheyne 
Hospital for Sick Children, Toronto, Ontario, Canada 

The auditory Ml00 is known to be generated by bilateral, simultaneously active, temporal lobe sources. The most accurate method for localizing this 
response is still under discussion since 2-dipole ECD solutions sometimes fail. Two strategies, both based on the assumption that auditory sources 
are spatially separate and do not interact, include collecting data from a whole-head MEG and computing dipole solutions based on reduced sensor 
coverage of either contralateral hemisphere or a smaller subset over the temporal area. However, we have shown previously [1] that single dipole fits 
with reduced sensor coverage resulted in positional errors along the medial-lateral axis and this was exacerbated in children. We suggested that these 
poorer fits were due to either smaller head sizes (resulting in greater source proximity and likelihood of interaction), or, poorer SNR due to greater 
distances from the head to the MEG sensors. In order to test these hypotheses, we simulated AEFs manipulating signal strength (high vs. low), head 
size (adult vs. child diameters), and level of background noise (uncorrelated Gaussian vs. correlated brain noise) and compared the effects of sensor 
coverage using whole-head (150 sensors), half-head (70) or a selected temporal cluster (40) on the source localizations. Our findings confirm that 
dipole fits using restricted sensor coverage can result in significant positional errors (>lcm) and these errors are exacerbated in the child-sized head. 
Furthermore, we demonstrate that low SNR in the presence of correlated brain noise produces the largest spatial errors, even if sources in both 
temporal lobes are modeled. 

Supported by grants from the Canadian Institutes of Health Research and NSERC of Canada. 

[1] Pang, E.W., Gaetz, W., Otsubo, H., Chuang, S., Cheyne, D. 2003. Localization of auditory N1 in children using MEG: Source modeling issues. 
Int. J. Psychophysiol. 51: 27-35. 
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Development of the 40-Hz steady state auditory evoked magnetic field 

D.C. Rojas, R Teale, M. Ramos, T.L. Benkers, J.R Carlson, M.L. Reite 

Department of Psychiatry 

University of Colorado Health Sciences Center, Denver, USA 

Adult subjects exhibit typical auditory 40Hz steady state evoked field responses generated by primary auditory cortex; the developmental course of 
this response is unclear. We studied 51 healthy subjects ranging in age from 5 to 52 years. Stimuli were 500 ms duration click trains with 25 ms 
between clicks and an inter-train interval of 1500 ms. Stimuli were monaurally delivered to each ear in separate blocks. Contralateral magnetic 
responses to 200 trials were recorded with a 4DNeuroimaging Magnes I, 37-channel biomagnetometer. Responses were averaged and digitally 
bandpassed filtered between 35-45 Hz offline. The ratio of the root mean square (RMS) field amplitude from 50 to 500 ms post stimulus onset to the 
RMS of the pre-stimulus interval was computed on all subjects. This 40 Hz steady state signal-to-noise ratio (SNR) measure was significantly 
correlated with subject age. No significant SNR differences were observed between left and right hemisphere recordings in this sample. Anterior- 
posterior hemispheric asymmetry of the sources may also increase with age. The RMS findings are consistent with previous EEG studies of steady 
state responses in children showing large age-related changes in the 40-Hz SSR. 


Auditory Cortex Dysfunction And Plastic Changes In Tinnitus: A MEG Study 

M. Santiuste (1), B. Bathal (2), M.C. Iglesias (2), L.F. Quesney (1)(3), C. Amo (1), A. Fernandez (1), F. Maestu (1), J. Poch (2), T. 

Ortiz (1) 

(1) Centro de Magnetoencefalografia. Universidad Complutense de Madrid. Espana. 

(2) Servicio de Otorrinolaringologia. Hospital Clinico Universitario San Carlos. Madrid. Espana. 

(3) McGill University, Dept. Neurology and Neurosugery. Montreal, Canada. 

Tinnitus is a distressing symptom affecting both normal and hearing impaired people. Our aim was to assess auditory cortex reorganization in tinnitus 
patients. We assumed tonotopicity in primary auditory cortex of normal hearing people, as reported elsewhere. We studied magnetic auditory evoked 
response (AER) by applying blocks of tone bursts (120 stimuli each) at 500, 1000, 2000, 4000 and 8000 Hz in 20 patients with tinnitus (mean age 47 
years) and in 10 normal-hearing controls (mean age 34 years). All subjects had normal BAEP’s and MRI findings. Mean tinnitus frequency was: 
8000 Hz (56% patients), 3000-4000 Hz (17% patients), 1000-2000 Hz (5% patients), 500 Hz or under (22% patients). Both ears were stimulated 
monaurally with simultaneous contralateral white noise. Recordings were obtained from both hemispheres with a 148-channel magnetometer. All 120 
stimuli for each block were averaged and the magnetic response occurring 100 milliseconds after stimulus onset (Ml00) was analyzed. Contrary to 
our expectations, we did not find a tonotopic organization of the auditory primary cortex and frequency encoding occurs as a cluster rather than 
linearly. An absence of magnetic AER in primary auditory cortex was documented in 18/20 patients (90%) suffering from tinnitus. Anatomical 
displacement of the auditory cortex was seen in 7/18 patients (39%). Five of these patients had abnormal magnetic AER in both auditory cortices 
whereas in 11 patients without auditory cortex displacement only 1 had bilateral affectation. Our findings demonstrate the existence of a primary 
auditory cortex abnormality detected by MEG in 90% of patients with tinnitus. Plastic reorganization of Heschl’s area was seen almost exclusively in 
patients with bilateral auditory cortex dysfunction. We believe that MEG should be used in the evaluation of patients with tinnitus and our 
observations might be instrumental in better understanding this condition. 
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Tonotopy, tonochrony and pitch mapping in Heschl’s gyrus of musicians 

P. Schneider 1 , M.Scherg 1 , H.G. Dosch 2 , H.J. Specht 3 , and A. Rupp 1 
Section of Biomagnetism, Department of Neurology, University Hospital Heidelberg; 

"Institute of Theoretical Physics, University of Heidelberg; Physical Institute, University of Heidelberg 

The Heschl’s gyms (HG) of musicians showed stmctural and functional enlargement of gray matter volume and the underlying auditory evoked 
activity [1]. Here, we investigate, if the tonotopic organization of the primary P30m activity (i) is preserved for the subsequent P50m response, (ii) is 
linked to the frequency dependence of peak latency (here referred to as ‘tonochronicity’) and (iii) depends on musical aptitude [1]. Using a 
Neuromag-122 whole-head MEG system, we recorded the AEFs to harmonic complex sounds with large parametric range (fundamental frequency 
range 100 - 5,600 Hz, complete and incomplete wideband of harmonics) in 38 musicians and 17 non-musicians. Approximately 300 cortical 
responses were averaged for each condition using the BESA program (MEGIS Software GmbH, Grafelfmg). The source activity was modelled with 
one equivalent dipole in each hemisphere and fitted to the time intervals around the P30m and P50m peaks. Localization and peak latency of the 
P30m activity revealed a strong logarithmic tonotopic and tonochronic organization in both left and right PAC (2.3 mm/octave; lower frequencies 
more lateral; 1.9 ms/octave, lower frequencies later). The P50m activity showed a ‘fundamental pitch mapping’ in the lateral portion of the left HG 
and a ‘spectral pitch mapping’ in the lateral portion of the right HG. Both, latency and localization exhibited a larger extension in musicians as 
compared to non-musicians (factor 1.7±0.2). Our neurophysiological data corroborate the cochleotopic organization of PAC [2]. The subsequent 
P50m response, which is generated in the pitch-sensitive lateral belt areas of HG, demonstrated a use-dependent ‘pitch mapping’ with respect to both 
localization and peak latency. Thus, our results support and extend previous findings about activity [1], tonotopicity [2,3] and asymmetry [4] of the 
human auditory cortex. [1] Schneider P. et al. Nat. Neurosci 5 , 688-694 (2002). [2] Romani GL, Williamson SJ and Kaufman L. Science 216 , 
1339-1340 (1982). [3] Formisano E. et al. Neuron 40 , 859-869 (2003). [4] Zatorre, R and Belin, P. Cereb. Cortex 11 , 946-953 (2001). 


Integration of near and far, auditory and visual stimuli 

J.M. Stephen 1 , D. Ranken 3 , C.J. Aine 1,2 , B. Hart 1 , J. Adair 1,2 , J. Knoefel 1,2 
University of New Mexico and New Mexico VA Healthcare System, Albuquerque, New Mexico, USA, Los Alamos National 

Laboratory, New Mexico, USA 

Based on monkey studies, Schroeder and Foxe [1] hypothesized that different cortical areas are responsible for integrating auditory and visual stimuli 
from near and far sources. Based on their work, we performed a magnetoencephalography study to determine whether multisensory responses were 
differentially activated in auditory association (AA) and superior temporal poly sensory (STP) areas for simulated near/far auditory (A) and visual (V) 
stimuli. The near and far A stimuli were simple loud and quiet tones. The near and far visual stimuli were simple soccer balls presented in a 
perspective drawing of a soccer field. Near and far A, V, and AV conditions were presented to the subjects. The V stimuli preceded the A stimuli by 5 
and 50 ms for the AV near and AV far conditions, respectively. The MEG data were modeled using the cortical start spatio-temporal modeling 
algorithm, which is an automated multi-dipole modeling technique [2]. Preliminary results show differential responses to near and far stimuli in both 
AA cortex and occipital cortex with a larger response to AV Near stimuli seen in AA and a larger response to AV Far stimuli seen in visual areas. The 
AV responses are distinct from the responses to A near and V far stimuli in the corresponding association cortices, respectively. These results help 
confirm a differential response to near and far AV stimuli. This study was supported in part by UNM RAC #C-2230-RAC, NIH COBRE 5-P20- 
RR15636-02, and MIND Institute grants. 

[1] Schroeder, C.E., Foxe, J.J. 2002. The timing and laminar profile of converging inputs to multisensory areas of the macaque neocortex. Cogn Brain 
Res 14, 187-198. 

[2] Ranken, D.M., et al. 2002. MEG/EEG Forward and inverse modeling using MRIVIEW. Proceedings of Biomag 2002, 785-787. 
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MEG responses form the frontal regions using a whole head SQUID system 

F. Takeuchi, K. Kamada, S. Kuriki, H. Kawaguchi 

Research Institute for Electronic Science, Hokkaido University, JAPAN,University of Tokyo Hospital, JAPAN,Graduate school of 

Medicine, Hokkaido University, JAPAN 

How do we measure MEG responses from the frontal regions precisely and stably using a whole head SQUID system? One of our solutions is to 
make subjects to lie prone, and to detect the responses from the frontal regions using the detection coils arranged at the back of the SQUID dewar. 
MEG signals were measured using a 204-channel SQUID system (Neuromag[l]) from seven Japanese subjects (4 females and 3 males, range of 20- 
21 years) lying prone on a bed. We used speech sound stimuli consisting of 30 words (Japanese) of concrete noun and tone stimuli that had 
modulated envelopes of the speech sound stimuli. These stimuli with a duration of 0.43±0.12 s and a intensity of 60-75 dB SPL were presented in a 
random order with random interstimulus intervals of 4-5 s. Subjects were instructed to generate internally a verb related to the presented word and to 
lift their left index finger when a speech sound stimulus was presented, and just to lift their left middle finger when a tone stimulus was presented. 
MEG responses for the speech stimuli and tone stimuli were averaged with respect to the onset of the stimuli. The averaged MEG responses were 
filtered to 0.1-40Hz, and the dc offset was removed using a time-average during a 1 s period before the stimulus presentation. Arrow maps of the 
MEG responses in a latency period of 0.16-0.30 s indicated neural activities at the right frontal region in four out of seven subjects for speech and 
tone stimulation. These activities might be related to the acoustic information processing or the task switching whether or not to generate a verb. 

[ 1 ]http://www.neuromag.com/ 


MEG Steady State Response to Broadband Sounds 

Y. Wang 1 , N. Ahmar 2 , J. Xiang 2 , D. Poeppel 1 ’ 3 , J.Z. Simon 2,3 

department of Linguistics, department of Electrical & Computer Engineerig, department of Biology, University of Maryland, 

U.S.A. 

The relevance of steady state response (SSR) in neuronal activity has been explored in contexts ranging from single cell recording to EEG and MEG. 
Narrowband stimuli allow straightforward analysis methods, but most behaviorally relevant auditory stimuli are broadband, e.g speech. We used 
presented 12 different auditory stimuli at at a variety of modulation rates and bandwidths, and analyzed the SSR as a function of changing bandwidth. 
A whole-head MEG, 160-channel, axial gradiometer system (KIT, Kanazawa, Japan) was used to acquire auditory evoked responses. The responses at 
every channel, in magnitude and phase, were determined. We find that the spatial distribution of strongly responding channels does not change 
significantly as a function of bandwidth, but the magnitude and phase of those responses changes consistently and significantly. In particular, 
magnitude decreases with increasing bandwidth. 

This work is supported by NIH grant number DC 05660 to DR 
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Altered representation of the auditory cortex in tinnitus patients 

C.Wienbruch a , I. Pauf, L. Roberts 13 , N. Weisz a , T. Elbert a 
a Univ. of Konstanz, Dep. of Psychology, Konstanz, Germany 
b McMaster University, Dep. of Psychology, Hamilton, Canada 

Tinnitus is an auditory phantom sensation usually accompanied by hearing loss as a consequence of noise exposure or the process of aging. We 
measured the magnetoencephalogram (MEG, MAGNES™ 2500 WH, 4D Neuroimaging, San Diego, USA) of 28 patients with tinnitus and 18 normal 
hearing controls without tinnitus. Subjects were stimulated binaurally with an amplitude-modulated tone (39 Hz modulation frequency; 180 seconds 
length). Eight carrier frequencies were used (384 576 864 1296 1944 2916 4374 6561) which were applied 60 dB above hearing-treshold (60 dB HL) 
via flexible tubing in the MEG sound delivery system. Two regional dipoles (BESA) were fitted for average data of each frequency.(l dipole per 
hemisphere) Between-group differences in dipole location were statistically examined with SAS (Mixed Effects Model, Variance Components 
estimated with restricted maximum likelihood method (REML) [1]). GROUP, HEMISPHERE and FREQUENCY were fixed effects, 
PATIENT(GROUP) was used as random factor. AGE was added as a covariate of no interest. Among other effects we found a significant interaction 
GROUP*HEMISPHERE*FREQUENCY (F(l,7)=2.23 p<0.034) for the dependent variable location (anterior-posterior). Dipoles for the 6 lower 
frequencies (384-2916 Hz) were fitted significantly more anterior in the right hemisphere compared to the left hemisphere in control subjects (t=- 
3.211 p<0.0016), reflecting a typical asymmetrical processing pattern. This was not the case for subjects with tinnitus . We interpret this as an 
indicator for an altered cortical representation of these frequencies in tinnitus patients 
We thank Willi Nagl for his statistical expertise and suggestion to model the data with mixed models. 

[1] Searle, S.R., G. Casella, and C.E. McCulloch, Variance Components. Wiley Series in Probability and Mathematical Statistics. 1992, New York: 
John Wiley & Sons, Inc. 


572 



ICA Methods for MEG Imaging 

J.E. Moran 1 , 'C.L. Drake, N. Tepley 12 

Henry Ford Hospital, Detroit, Michigan, USA. Oakland University, Rochester, Michigan, USA. 


ABSTRACT 

Activity of individual cortical sources cannot be uniquely imaged when MEG data is a sequence of complex spatial pattern of multiple cortical 
sources. Auxiliary constraints integrated into the imaging equations are required to remove the mathematical ambiguity. Therefore, it is important to 
adapt source separation techniques to MEG imaging. It is much easier to accurately image field patterns of isolated brain electric sources. 
Therefore, we demonstrate how a combination of second and fourth order ICA methods can be used to remove noise and isolate source activity for 
improved MEG imaging accuracy. A second order ICA technique was used to extract respiratory and eye movement artifact by exploiting cross¬ 
correlation differences over time between cortical sources and artifact. For brain electric source separation, a fourth order ICA technique that 
quantified probabilities of simultaneous source activity was used to separate brain electric sources characterized by bursts of oscillatory circuit 
activity. 

INTRODUCTION 

MEG data consists of an unknown mixture of noise, artifact and signals from unknown brain electric sources. Often these sources are compact 
cortical networks that are sequentially activated to perform simple or complex tasks. However, coherent activity across an extended region is often 
simultaneously present with compact source activation or as a primary mode, such as during sleep. Unfortunately, MEG recorded source activity is 
not unique to the true distribution of brain electric sources and contains significant artifact. It is usually necessary to eliminate artifact and desirable 
to extract the spatial-temporal MEG signal for each source prior to imaging source activity. Frequently, popular methods of removing noise, such as 
frequency filtering or signal averaging, are inadequate or inappropriate for removing artifact. However, much of this artifact is due to semi-stationary 
rhythms of respiration, slow eye movements and heart activity that can be separated from brain electric signals using a combination of second order 
(time correlation) and fourth order (activation probability) independent component analysis techniques. Finally, the time course of spontaneous and 
task related brain electric source activity is characterized by bursts of oscillatory activity of limited duration. In addition, sources are usually 
uncorrelated or sequentially activated with limited temporal overlap. This temporal behavior can be exploited by fourth order ICA techniques for 
isolating neuronal source activity and the corresponding spatial pattern representation in MEG array. 

METHODS 

MEG studies: Our 148 channel whole head Neuromagnetometer (WH2500 Magnes, 4D Neuroimaging), was used to measure magnetic fields in 
five individuals for one hour of quiet rest during which they fell asleep. While in a supine position in a quiet dark magnetically shielded room, each 
subject was instructed to remain awake during the first two minutes of the study. Simultaneous recording of ECG, EOG, and C3, C4, Oz EEG 
channels were stored with the MEG data. For each subject 10 seconds of MEG data for awake, transition to stage 1 sleep, and stage 1 sleep were 
selected for imaging cortical activation. These data contained significant artifact associated with respiration and heart activity. MEG data was 
sampled at 508 Hz and initially band-pass filtered 0.1-100 Hz before disk storage. During analysis, these data were further band-pass filtered 0.2 to 
25 Hz. Heart artifact was removed using a fourth order ICA technique applied to the MEG data and utilized the simultaneously recorded ECG. 
Respiration artifact was eliminated using a second order time correlation ICA technique. Estimates of time dependent brain electric network signals 
and corresponding MEG spatial patterns were estimated using a fourth order probability technique. 


ICA Methods: The second order technique used is AMUSE [1]. This technique is useful for separating sources based on changes in second 
order correlation that occur with time. Mathematically this relationship is: 

B and B x are (channel by time = row by column) matrices of MEG data offset by time increment, x. 

B t = a(x)B + [B t - <x(t)B] with a(x) = B t B[BB t ] 1 then: 

BB t T + B t B t = BB T a(x) T + a(x)B x B T with singular value decomposition: B = UkV T (1) 


using the above relationships, the second order ICA components are: 



VLF with VV T + V V T 

ICA T T 





The fourth order technique is implemented in two steps. First a singular value decomposition of the MEG data, B, defined is performed to obtain 
the singular value time series components, V = [v l5 ..., v N ] defined in Equation 1. Noise reduction and elimination of noise components can be 
performed during this step. Second, the fourth order correlations C ^ of the vectors, v f , v y - , \ k , \ h are calculated for all combinations of ij , k and /. 
These correlations can be used to calculate fourth order cumulants [2] related to component density distributions. However, for the orthonormal 
vectors, V, the fourth order correlations can be utilized to obtain fourth order independent components. The goal of the technique used in this 
research is to rotate the singular value decomposition vectors, V, until fourth order correlations, C )jj k , and C ukk have been minimized. The ICA 
technique, JADE [2], has been developed for this purpose. However, we have developed a 4 th order technique that transforms the vector subspace, V, 
to Vica using a sequence of ICA component probability operators, P P (t), (subscript corresponds to the particular ICA component). ICA components 
are only estimates of the unknown brain electric sources, (See [1] for identifiability factors). Therefore, we used the squared amplitude of an ICA 
component, v P (t) 2 , at each instant in time as a measure of the probability that the corresponding unknown source, s P (t), is active. Thus, fourth order 
correlations C ) jkk can be interpreted as the average probability per time increment that sources s y (t) and s*(t) are simultaneously active at time points 
within the time interval, f to t 2 . The fourth order correlations C )jj k address how the sources s y (t) and s*(t) constructively or destructively interfere 
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when they are part of a combined component. Since fourth order separation techniques minimize these cross correlations, the ICA components tend 
to be active during short time intervals that do not overlap, Fig. 1. 


In our technique, the probability operator for any ICA basis vector, v P (t) 
is the diagonal matrix, diag(v P (t) ), with all time components for the time 
interval, to t 2 , on the diagonal. Next, a recursive algorithm is used to 
construct ICA basis vectors, V ICA = [v 1? ..., v P ,..., v N ], such that v P is the 
principal eigen-vector u x with eigen-value = C pppp of the correlation 
operator P P = diag(v P (t) 2 ) in the vector subspace, [v P ,..., v N ]. 

v p is the principal eigen-vector of V p = [v p ,---,v ] 
where (2) 

V P V = k = eigen-value matrix 

P P P ° 


RESULTS 

ECG heart artifact was eliminated from the MEG data by setting P P (t) in equation 2 to the normalized squared amplitude of the simultaneously 
recorded ECG channel. The first ICA component of this 4 th order decomposition corresponds to the heart artifact in the MEG data, Fig 2. The 
second order AMUSE technique is more appropriate for extracting respiration and eye movement artifact which is continuous through the data, Fig. 
3. For MEG imaging, twelve 4 th order ICA components corresponding to neuronal activity were extracted for each of the awake, transition, and stage 
1 data segments. An ICA source component for one subject is shown in Fig. 1. For each ICA component, the MEG array spatial representation was 
imaged using MR-FOCUSS [3,4] and combined to obtain the sequence of activation for all cortical model locations. During sustained wakefulness, 
activity in the occipital cortex and region of the posterior thalamus dominated the subjects averaged brain activation. During the transition from 
wakefulness to sleep onset, this activation pattern changed to one showing greater activation in the right frontal cortex and region of the anterior 
thalamus. During stage 1 sleep, maximum activity was primarily in brain regions near the anterior thalamus. 
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Fig. 3 Respiration and eye 
movement artifact extracted 
using 2 nd order ICA technique. 
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Fig. 2 Left, ECG and MEG data containing extensive heart artifact. Center, ICA heart artifact 
component extracted from MEG data. Left, MEG data with heart artifact removed. 
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Fig-1 A fourth order ICA component extracted from MEG data 
including the transition for awake to stage 1 sleep has the majority 
of component activity localized within a short time interval. 


DISCUSSION 

We have successfully developed a combination of second and fourth order ICA techniques that have enabled the systematic study of change in 
brain activity related to falling asleep. In addition we have found these techniques useful for analysis of spontaneous epilepsy data and complex task 
performance data, such as vehicle driving tasks. In addition, to facilitate the widespread use of these techniques we are developing user interfaces 
and automated procedures for both ICA data cleaning and imaging. 

ACKNOWLEDGEMENT: This research was supported by NIH/NINDS Grant RO1-NS30914. 
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Intersubject Spatial ICA of Temporally Independent Neuromagnetic Sources 

and their Time-Frequency Dynamics 

Ramirez, R. R., Moran, K. A., Kronberg, E., Ribary, U., and Llinas, R. R. 

Department of Neuroscience and Physiology, NYU School of Medicine, New York, NY, USA 


ABSTRACT 

A method for intersubject analysis of magnetoencephalography (MEG) data is developed. The MEG time-series of each subject is individually 
expanded with infomax ICA (EEGLAB Matlab Toolbox) into a complete set of temporally independent activation functions and their associated 
spatial maps. The spatial maps of the independent components are localized using an anatomically and neurophysiologically constrained recursive 
weighted minimum-norm algorithm that finds relatively sparse solutions by minimizing an optimal diversity measure. The cortical independent 
source estimates are morphed into a surface-based spherical coordinate system (Freesurfer). Each hemisphere of each subject is represented on a unit 
sphere by aligning the individual convexity/concavity patterns of the gyri/sulci to an averaged folding pattern. The distributed source estimates of all 
subjects are interpolated to a standard spherical surface-atlas triangulated with relatively uniform segments. Inter-subject spatial ICA is performed on 
these bi-spherical maps to obtain spatially independent sources common to the group. Sources are also clustered based on their time-frequency 
dynamics. Spatial ICA is performed on the vectorized event-related spectral perturbation (ERSP) and/or inter-trial coherence (ITC) of sources of 
interest for all subjects. These methods extract the spatial prototypes of independent sources (i.e., brain modules) and their time-frequency dynamics 
(i.e., ERSP/ITC) on canonical bi-spheres and time-frequency planes. To illustrate this technique, we analyze visual apparent motion data. Results 
demonstrate that we can reliably and objectively map motion-sensitive regions and spontaneous sources (common across subjects), and their 
universal time-frequency dynamics at theta, alpha, beta, and gamma bands. Individual differences can be quantified based on similarity measures. 

KEY WORDS 

Magnetoencephalography (MEG), Neuromagnetic Source Imaging, Independent Component Analysis (ICA), Inter-subject Analysis, Time- 
Frequency Analysis, Event-Related Spectral Perturbation (ERSP), Inter-Trial Coherence (ITC), FOCUSS. 

INTRODUCTION 

One of the most challenging problems in the field of electromagnetic imaging is the effective characterization of inter-subject variability. Even 
though the same neuronal networks may be active in most subjects, the spatial distribution of these brain sources cannot be averaged accurately 
through a simple scaling, rotation, and translation process. For example, due to the highly variable and convoluted patterns of cortical folding, a 
particular retinotopically-defmed area of VI, may be localized to very different Euclidian coordinates for different subjects. Nevertheless, each VI 
forms a topologically continuous cortical patch due to the modularity of cortical organization. Thus, a high-resolution inter-subject average can be 
constructed if all the V1 patches are first unfolded and aligned. A promising approach is to transform the whole cortical surface of each hemisphere 
into a sphere, and to align all individual spheres, based on their gyri and sulci, to an averaged folding pattern [Fischl, 1999]. Once the spheres are 
aligned, the source distributions can be averaged. In this MEG study, we propose an alternative to simple averaging. Temporal ICA is used to separate 
the mixed magnetic signals into maximally independent components, which are then localized separately. Spatial ICA on a spherical brain atlas is 
then performed on the distributed current density estimates of all the independent brain sources of all subjects. To determine what aspects of the 
source dynamics are shared by the group of subjects, spatial ICA is also performed on the ERSP and ITC of all independent components. 

METHODS 

Eight healthy subjects (four females and four males) with no neurological or psychiatric disorders signed consent forms in compliance with a 
protocol approved by the IRB. Magnetic field signals were recorded with a 148-channel neuromagnetometer system (4D Neuroimaging, San Diego) 
while subjects were presented with apparent motion stimuli. The stimuli consisted of two bars of light oriented diagonally and located 2 degrees away 
from a fixation dot along the diagonal of the lower right visual quadrant. The light flash duration was 1 ms and the stimulus onset asynchrony (SOA) 
varied from 0 to 36 ms in steps of 6 ms. All apparent motion stimuli had a left and downward direction. The seven SOA conditions were recorded in 
seven separate runs. Apparent motion stimuli were presented in sets of 5, separated by a resting period. A low volume auditory tone was used to 
indicate the beginning and end of each set. Data were collected in epoch mode with a sampling rate of 1017.25 Hz. Each epoch consisted of a 500 ms 
pre-stimulus period followed by a 750 ms post-stimulus period. The mean of the pre-stimulus period was subtracted from each channel separately for 
DC offsetting. Data were then analyzed with the infomax ICA algorithm (EEGLAB Matlab Toolbox) to find maximally independent components 
[Delmorme, 2004]. The MRI of each subject was segmented, inflated, and registered to a surface-based spherical atlas with the Freesurfer software 
[Dale, 1999] [Fischl, 1999]. The sourcespace was constructed by subsampling a large number of vertices from the high-resolution tessellated cortical 
surfaces. A multi-spherical volume conductor model was used to compute the leadfield matrix. The magnetic field spatial sensor maps (i.e., the 
column vectors of the mixing matrix) were localized separately with an anatomically and neurophysiologically constrained recursive weighted 
minimum-norm algorithm (e.g., FOCUSS) that finds sparsely distributed solutions [Ramirez, 2003] [Rao,2003]. The algorithm parameters were set to 
minimize an optimal diversity measure [Ramirez, 2003]. An a priori threshold on the maximal current density was used to solve the source extent 
estimation problem [Ramirez, 2003]. The estimates of the independent brain sources of all subjects were interpolated to a surface-based spherical 
atlas in MATLAB (Mathworks). Inter-subject spatial ICA of the interpolated source estimates was performed using the infomax ICA algorithm 
(EEGLAB Matlab Toolbox) with a PCA dimensionality reduction. To characterize the time-frequency dynamics of the independent sources, the 
ERSP and ITC (i.e., phase-locking factor) of each independent activation function were computed with the complex Morlet wavelet transform. 
Importantly, this standard transform was modified to allow an increasing number of cycles for increasing frequencies in order to achieve the desired 
time-frequency resolution at high frequencies. Bootstrap confidence intervals were computed based on the pre-stimulus period [Delmorme, 2004]. 
Inter-subject spatial ICA of the ERSPs and ITCs was also performed using the infomax ICA algorithm (EEGLAB Matlab Toolbox) with a PCA 
dimensionality reduction. 
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RESULTS 


Temporal ICA founded independent sources that reacted to apparent motion stimulation, and spontaneous independent sources that did not 
respond strongly to stimulation. The spatial map of many of the independent components had a dipolar structure. Motion-sensitive independent 
sources localized to hMT+, V3A, VI, V2, LG, FG, LOS, IPS, STS, and other clusters in parietal, frontal, and temporal lobe. Inter-subject spatial ICA 
of the independent sources extracted canonical brain modules shared among most subjects. These canonical spatially independent sources were 
spatially non-overlapping and smooth. Inter-subject ICA of the ERSP and ITC extracted canonical time-frequency bursts of power and coherence 
within encapsulated regions of the time-frequency plane. Bursts occurred at the theta, alpha, beta, and gamma frequency bands, and within specific 
time windows. Some of the time-frequency bursts suggested a relationship between harmonics (i.e., octaves). 

DISCUSSION 

Our results demonstrate that inter-subject spatial ICA of independent source estimates can be used to cluster current density distributions into 
functional groups defined by the canonical independent sources of the brain and their Voronoi cells. In our view, each canonical source represents a 
particular brain module or sub-module. These methods enhance our understanding of the modularity of the brain, and its inter-subject variability. 
Inter-subject ICA of the time-frequency dynamics can also help us characterize the inter-subject variability of module-specific responses. However, 
the time-frequency coefficients of the single-trials should ultimately be used for finding canonical single-trial dynamics since the ERSP and ITC are 
just another type of average. 
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ABSTRACT 

Effects of head models on MEG source localizations were studied with three finite element models of the head constructed from segmented MR 
images of an adult male subject. The complexity of the models varied from 9 to 11 tissue types. The lead fields due to dipolar sources in the motor 
cortex were computed for all three models. The magnetic fields were computed at 145 sensor coil locations at 1 cm above the scalp. For a given 
MEG profile, the inverse source localizations were performed with an exhaustive search pattern in the motor cortex area. A set of 100 trial inverse 
runs were made. It was found that the model with most complexity performed best in localizing the sources in the motor cortex area. 

KEY WORDS 

MEG Inverse, Finite Element, Head Models, Source Localization 
INTRODUCTION 

Highly heterogeneous finite element FEM models of the head have recently become increasingly popular for the solution of the forward and the 
inverse problems in EEG and MEG. How different tissue surfaces influence the MEG source localizations is studied here. In particular, we examined 
the effects of CSF, gray and white matter on the inverse source localizations in the motor cortex area. Our results complement previous other studies 
where one has examined how realistic models of the head influence the forward and inverse EEG/MEG simulations. Previous studies with boundary 
element models of the head have examined how volume currents affect the forward EEG/MEG simulations and also their effects on inverse source 
localizations [Fuchs et al., 2001] [Fuchs et al., 2002]. Recently, a five tissue type FEM model of the head has also been used for EEG/MEG 
simulations and source reconstructions [van Uitert et al., 2003] [Wolters et al., 2004]. 

METHODS 

Finite element models of the head were constructed from the segmented MRI (magnetic resonance imaging) slices of an adult male subject. The T1 
weighted sagittal MRI slices with 3.2 mm thickness were collected with a 1.5 Tesla GE Signa scanner. The original MR slices are of 256x256 
resolution with 1 mm size pixels [Haueisen et al., 2002]. A total of 51 contiguous slices were used. Eleven tissue types were identified in the image 
slices and then sub-sampled to a 2x2 mm resolution for forward and inverse simulations. The finite element model of the head has a voxel resolution 
of 2x2x3.2 mm with a total of about 1.5 million voxels. We used four models: 

Model 1: Full model with 11 tissue types. 

Model 2: Model with the conductivity of gray matter was set equal to the white matter, i.e., 10 tissue-type model. 

Model 3: Model with the conductivities of gray matter and CSF were set equal to the white matter, i.e., 9 tissue-type model. 


Refer to figure 1 for model details. The left figure (fig. la) for 
model 1 has all the tissue surfaces intact. In the middle figure 
(fig. 2b) for model 2, distinction between the gray and white 
matter boundaries has been eliminated. Similarly, in the right 
figure (fig. 2c) for model 3, there is no distinction between the 
CSF, gray and white matter tissue boundaries. 

Some of the tissue resistivity values used in the models are: 
gray matter: 3.0; white matter: 7.0; CSF: 0.65; soft bone: 21.8; 
hard bone: 160; scalp: 2.30. All resistivity values are in Ohm 
meter. These values have been used by us before in our head 
modeling studies [Haueisen et al., 2002]. These resistivity 
values were used in model 1. . In model 2, the resistivity of the 
gray matter was set equal to the white matter resistivity (7.0 
Ohm.m) to eliminate the distinction between the gray and white 
matter. Other tissue resistivity values were kept same as in model 1. In model 3, the resistivities of gray matter and CSF were set equal to the white 
matter resistivity (7.0 Ohm.m). 

Using an adaptive finite element solver, The lead-fields at 1 cm above the 145 scalp points were computed for all three models due to unit dipolar 
sources in the motor cortex area. The scalp points were uniformly distributed to cover the whole head and it was assumed that the biomagnetic sensor 
coils were located at 1 cm above those scalp points. The motor cortex within a volume of 3 cm cube was represented by 716 voxels. The lead fields 



Figure 1. Human head models with varying tissue complexities, (a) model 1 with 
11 tissue-types, (b) model 2 with distinction between gray and white matter 
removed, and (c) model 3 with no distinction between CSF regions, white matter 
and gray matter. 
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were computed for x, y and z orientations of the dipoles in the motor cortex. All computations were performed on an Intel 1.7 GHz workstation with 
1.2 GB memory. Each run took about 30 minutes. Post-processing and visualizations were done using Matlab software, version 6.5 (Mathworks, 

Inc., Natick, MA). A 3-D view of the head model and the scalp EEG electrode positions are shown in figure 2. 

The forward problem, i.e., MEGs were simulated at 145 biomagnetic sensor coil positions using model 1. 

For each trial, a unit magnitude dipolar source was placed at a random position in the motor cortex. These 
forward simulated MEGs were then used for inverse source localizations using the lead-fields of three 
different models. Inversions were performed with the least-squares technique. An exhaustive search 
pattern was used, i.e., inversion was performed for each possible source location in the motor cortex and 
the site producing the smallest residual norm was selected as the best possible source location. A set of 
100 trial inverse runs were made. 

RESULTS 

The mean localization errors (MLE) and standard deviations (STD) for 100 trials are: 

Model 1: 1.0=1=1.48 mm, 

Model 2: 1.69± 1.48 mm, 

Model 3 : 1.84=1=1.6 mm. 

These values show that MLE increase as one goes from model 1 to 2 to 3. 

DISCUSSIONS 

These results suggest that head model complexities influence the MEG inverse source localizations. The source localization error slightly increases 
from model 1 to model 2 where the distinction between the gray and white matter has been eliminated. Model 3 has largest MLE as compared to the 
full model, i.e., model 1. In model 3, the difference between the CSF and brain matter was eliminated. The resistivity of the CSF is less than the gray 
and white matter. So in full model the currents will follow the structural paths of CSF channels in the brain tissue. In model 3 this distinction does 
not exist. CSF plays an important role in distributing the currents in a head model which eventually also influences the scalp potentials [Ramon et al., 
2004]. This could also be the reason that model 3 performs poorly as compared to the model 1 in inverse localization of the sources from the MEG 
data. 
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Figure 2. A 3-D view of the head 
model superimposed with 145 EEG 
electrode positions. The biomagnetic 
sensor coils (not shown) are 1 cm 
above the electrode positions. 


578 




MUSIC Seeded Multi-Dipole MEG Modeling Using the Constrained Start Spatio-Temporal 

Modeling Procedure 

Douglas M. Ranlcen 1 , Julia M. Stephen 2 , John S. George 1 
1 Los Alamos National Laboratory, LA, NM, USA ^University of New Mexico, Albuquerque, NM, USA 


ABSTRACT 

The Constrained Start Spatio-Temporal modeling program (CSST) is an objective multi-dipole, multi-start MEG/EEG analysis procedure that 
randomly selects from 100 to 100,000 initial dipole configurations, and runs a nonlinear simplex search on each of these configurations employing 
a reduced Chi-square statistic as the minimization criterion, to obtain a set of dipole configurations that best fit the data [1]. A parallel version of 
CSST is implemented in IDL and MPI, making CSST usable on a single computer, or on a Linux cluster. We have now developed a multi¬ 
resolution version of MUSIC [2] that provides an 80% or more reduction in the number of forward calculations needed to obtain results 
comparable to a 160,000 point MUSIC scan, on a 2mm grid that defines a brain volume. The multi-resolution MUSIC scan provides an improved 
set of initial dipole estimates for the CSST analysis. In preliminary tests on real and simulated MEG data, with model orders ranging between 5 
and 7 dipoles, the best performance improvements were obtained by mixing in 1 to 3 dipole locations randomly drawn from the best MUSIC 
locations, with randomly selected locations from the brain volume to complete the selected model order. We have also developed an improved 
method for sampling the brain volume for initial configurations. These improvements have led to a 75% reduction in the number of starting 
configurations required to obtain 5-10 best solutions with equal or lower reduced Chi-square values, when compared to the best solutions from 
the previous version of CSST. 

INTRODUCTION 

Spatio-temporal multi-dipole methods are a commonly used approach to solving the MEG/EEG inverse problem. The CSST program, based on 
MSST [3], has been tested on complex simulated data [4], and used in analyses of real MEG data [5]. We discuss here two recent enhancements of 
CSST: a new sampling of the brain volume, from which dipole starting configurations are selected; and the use of a MUSIC procedure to obtain 
better starting dipole configurations for the CSST analysis. We also discuss the multi-resolution MUSIC procedure developed for this application. 

METHODS 
Brain Volume Sampling 


Previously, we obtained the set of all dipole starting locations by using a segmentation of cortex. In this 
project, we developed a variable resolution procedure that samples the brain volume with decreasing resolution 
for regions further from the head surface, where MEG/EEG resolvability decreases (see Figure 1). This sampling 
procedure, used to obtain the set of locations from which initial dipole configurations are drawn, has yielded a 
50% reduction in the number of initial starts needed by CSST to obtain comparable results when compared to 
the cortex sampling used in the past. This is probably because the cortical sampling is too fine-grained for 
optimal use with the nonlinear simplex search procedure, which can move dipole locations several centimeters 
over the course of the procedure. We have tested various sampling resolutions, and have found that a sampling 
using approximately 7000 points provides sufficient resolution for both CSST and the MUSIC procedure 
discussed below. 

Multi-Resolution MUSIC MUSIC . 

We have implemented a variant of the classic MUSIC algorithm[2], and use it with the brain volume sampling discussed above. To find the 
peaks in the MUSIC scan, we first sort the locations in increasing order, using the corresponding smallest singular value for each location, when 
projected into the noise subspace. We apply a clustering algorithm to the 100 best MUSIC locations, and increase this number in increments of 50 
until N clusters are found, where N is the estimated dipole model order. For each cluster, the best 4 locations (based on lowest singular values) 
are interpolated, to determine the corresponding dipole location. For several simulations, this coarse MUSIC scan was able to provide adequate 
dipole localizations. In other simulations, the MUSIC solutions were not adequate, but we observed that the solutions were near the correct 
locations. We developed a variable resolution multi-grid method which refines MUSIC localizations by imposing increasingly fine grids around 
solution locations found by a MUSIC scan at the previous grid resolution level. At each level the volume covered by the grids is reduced 
proportionally to the change in grid resolution. We compared this approach to a standard MUSIC scan at 2mm in a brain volume, having 160,000 
grid locations. We were able to localize sources with this multi-grid approach as well or better than the standard scan using fewer than 20% of the 
160,000 MEG/EEG forward calculations required by the standard method. The added computation time for peak finding and grid generation with 
multi-grid MUSIC is negligible, making this approach five time faster than the standard method. This multi-grid approach could be used with 
other variations of MUSIC. These are preliminary results - more work will need to be done to optimize this approach, as well as determine what 
inherent limitations a multi-grid approach may have. 

MUSIC-Seeded CSST 

For difficult MEG/EEG inverse analyses (high model order, low SNR), CSST requires numerous random starts to achieve high confidence that 
a global minimum to the reduced Chi square function has been found. For instance, a six dipole model of MEG visual data may require 15,000 
random starts with a CSST analysis. By incorporating the results of a MUSIC analysis into the selection of starting dipole configurations in CSST, 
we can reduce the number of starting configurations needed to obtain accurate CSST results. In analyses of several sets of simulated data, we 



Figure 1: Volume Sample 
Points used with CSST and 
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observed that the multi-resolution MUSIC results were not as accurate as a CSST analyses of the same data. Also, for data sets with high noise 
levels and/or synchronous activity, a nonlinear simplex search from the MUSIC locations would not converge to the actual dipole locations. We 
expected these problems with our version of the MUSIC algorithm. We believe we can create simulated data that will cause similar problems with 
the other variations of MUSIC, but that is beyond the scope of this paper. 

Our current MUSIC-seeded CSST (MS-CSST) algorithm performs one MUSIC analysis on a variable density grid of approximately 7000 
locations. In the clustering approach described above for finding the MUSIC peaks, we save the best 20 dipole locations in each cluster for use 
with CSST. Our software allows us to try various sampling strategies when mixing in these MUSIC dipole locations with randomly selected 
locations, while creating the CSST starting dipole sets. We first illustrate results for a five dipole MEG simulation, based on an actual 
subject/sensor configuration with a Neuromag 122 system (see Figure 2). We gave the dipoles highly-overlapping timecourses, and created a data 
set with low signal to noise, so that our version of MUSIC would not localize all of the dipoles well. We performed an MS-CSST 5-dipole analysis 
using 100 pseudo-random initial dipole configurations, creating 50 sample configurations with 2 MUSIC locations and 3 random locations, and 50 
samples with 1 MUSIC location and 4 random locations. We analyzed the same data using CSST with no MUSIC-seeding, using 1000 initial 
dipole configurations, saving the best 100 based on reduced Chi square values. We compare the reduced Chi square results for these 2 analyses in 
Figure 3. Comparing the best 26 fits from each analysis, MS-CSST outperformed regular CSST, with 1/10 as many initial dipole configurations. 



Figure 2: Five dipole MEG simulation created using the 
Forward Simulator in MRIVIEW. 
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Figure 3: Reduced Chi square plots for analyses of 5 dipole simulation. 


We have compared the MS-CSST and CSST analyses on several sets of human subject MEG data. Shown below are the results of a 6 dipole 
analysis with MS-CSST and CSST of MEG data for a complex visual task. Compared to 2000 starting dipole configurations for the CSST analysis, 
an MS-CSST analysis using 400 starting configurations gave comparable results for the best 13 solutions, with virtuallyidentical results for the best 
5 solutions. For the MS-CSST analysis we combined 3 dipole locations with 3 random locations for the starting dipole configurations. Figure 4 
shows that the clustering of the best 5 results for both analyses are comparable. Figure 5 shows the corresponding reduced Chi square plots. 



Figure 4: MEG 6 dipole analysis of visual data, MS-CSST best 5 of 400 
starts (left), CSST best 5 of2000 starts (right). 



0 2 4 0 0 10 12 14 

Dipole Solution # 


Figure 5: Reduced Chi square plots for 6 dipole analyses of 
MEG visual data. 


DISCUSSION 

In our analyses of simulated MEG data, MS-CSST outperforms CSST with 90% fewer initial dipole configurations. In analyses of real MEG data, 
the performance improvement has been less, requiring 75 - 80% fewer starting configurations with MS-CSST versus CSST to produce comparable 
results for the best 5-10 solution sets. We are continuing to explore methods for incorporating the MUSIC analysis results in the CSST starting 
dipole sets, to provide a better set of initial dipole locations while maintaining the stochastic nature of the CSST procedure. This will include 
incorporating the spatial variance of the MUSIC dipole clusters in the CSST starting dipole selection process. We will also be trying other 
variations of the MUSIC algorithm, to determine which provide the closest results to a full CSST MEG/EEG analysis with human subject data. 
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ABSTRACT 

Spatial filtering methods for localizing brain electrical activity, such as linearly constrained minimum variance (LCMV) filtering, have shown 
promise in MEG applications. In this paper we employ recent results in estimation theory to predict mean squared error (MSE) localization performance 
of an LCMV algorithm for MEG. The MSE localization performance realized by spatial filtering methods is known to be a strong function of signal-to- 
noise ratio (SNR), the number of data samples available for covariance matrix estimation, and signal modeling errors. Below the so-called estimation 
threshold SNR, the location MSE rises rapidly due to the increased impact of false peaks in the LCMV cost function. This analysis explores the 
potential of a technique of MSE prediction that has obtained remarkably accurate predictions of MSE localization performance of nonlinear estimation 
schemes. The canonical case of a single dipolar source with known moment in white noise is considered. The MSE predictions account for finite data 
sample effects, cover regimes of very low SNRs that include the threshold SNR, and hold the potential to account for signal modeling errors. 

KEY WORDS 

MEG, source localization, mean squared error, linearly constrained minimum variance filters, threshold SNR. 

INTRODUCTION 

Spatial filtering methods for localizing brain electro-magnetic activity have been developed in recent years [Van Veen, et al., 1997, Dogandzic, 
et al., 2004]. These techniques exploit the spatial (and temporal) coherence of arrays of EEG and MEG sensors that measure the electric potential 
on the scalp and the magnetic field around the head, respectively and reduce the effects of background noise (interference and correlated sources) 
without distorting the signal of interest to provide high-resolution localization of neuronal activity in the brain. It is desired in this analysis to explore 
the potential of recent results in estimation theory [Richmond, 2004] to predict the mean squared error (MSE) of location estimates obtained via 
linearly constrained minimum variance (LCMV) spatial filtering [Van Veen, et al., 1997] The MSE localization performance realized by spatial filtering 
methods is known to be a strong function of signal-to-noise ratio (SNR), the number of data samples available for covariance matrix estimation, and 
signal modeling errors [Van Trees, 2002, Richmond, 2004]. Below the so-called estimation threshold SNR, the location MSE rises rapidly due to the 
increased impact of false peaks in the LCMV cost function. This present work considers the MSE prediction of the MEG based LCMV location error 
for a single dipolar source of known moment orientation over a range of SNRs that includes the threshold region. 


LCMV SPATIAL FILTERING 

Let the iV x 1 column vectors 1 x(/) for / = 1,2,. 
dipolar source in noise: 


., L be the magnetic field strength measurements at N sensor sites on the scalp due to a single 
x(Z) = H(0^)m • S + n(Z) (1) 


where H (Ot) is the N x 3 matrix whose columns represent solutions of the forward problem for the dipole source located at spatial position Ot', m 
represents the 3x1 dipole moment component that will be assumed known and fixed over the observation time; the scalar A is a parameter that will be 
used to vary the measured signal strength (SNR); n(/) represents the background noise. 

c\ 

Let h(0) = H(0)m be the forward solution for a dipole at location 0, and let parameter S be zero mean with variance <j$. Let the data covariance 
be denoted by U{xx } = R = + <7gh(0^)h (Ot)- The LCMV method minimizes of the output power of a linear filter w constrained to have 

unit gain at the location of the assumed source position of interest. Using the set of vector observations X = [x(l)|x(2)| • • • |x(L)] to estimate the 
data covariance via R = XX , localization is accomplished by scanning the resulting power estimate (normalized by the noise contribution) over the 
location variable 0: 

h T (0)R -1 h(0) N 


Plcmv(O) = 


L-N 


(h^(0)h(0)) 


Ot — argma xPlcmv(O)- 

0 


( 2 ) 


o 

The maximum output provides an estimate of the signal power a s and the source location estimate is given by the scan value of 0 that achieves this 

maximum. The LCMV cost function 'ipLCMv(O) is obtained by replacing R with the true covariance R in the expression for Plcmv(O). Note that 
at high enough SNR, Ot is the location of the global maximum of MV ( 0 ). 

MEAN SQUARED ERROR PREDICTION 

The interval error based approach to MSE prediction applied herein decomposes the MSE expression into two components: “no interval errors” 
(NIE), and “interval errors” (IE). The parameter search space, i.e. the scanning domain for 0, is divided into disjoint mutually exclusive neighborhoods 
based on the characteristics of underlying LCMV cost function and interprets the resulting MSE in terms of these neighborhoods. When location 
estimates fall inside the neighborhood containing the global maximum Ot , then it is said that “no interval errors” have occurred. Such estimates result 
in the smallest contributions to the MSE, often referred to as local errors. This “no interval errors” component represents the contribution dominating 
the MSE within the asymptotically high SNR regime. The large sample local error MSE approximations of [Vaidyanathan, et al., 1995] are used for 
this “no interval errors” component contribution and will denoted by Yiyb(0t)- 


^he notational convention adopted is as follows: italics indicates a scalar quantity, as in A; lower case boldface indicates a vector quantity, as in a; upper case boldface 
indicates a matrix quantity, as in A. The matrix transpose is indicated by a superscript T as in A T . 

*This work was sponsored by the Defense Advanced Research Projects Agency under Air Force contract F19628-00-C-0002. Opinions, interpretations, conclusions, 


and recommendations are those of the author and are not necessarily endorsed by the United States Government. 
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LCMV Cost Function 

- based on 74 Channel Magnes II Biomagnetometer 

- SNR = -23 dB 


Dipolar source in center of Somatosensory Region 



Figure 1: (a) LCMV cost function plotted on inflated cortical surface, (b) Monte Carlo Based MSE, and Theoretical MSE Predictions 


When location estimates fall outside the local neighborhood of the true peak 0^, then it is said that an “interval error” (IE) has occurred. This results 
as a consequence of the LCMV algorithm choosing maxima of the Plcmv (0) that can be traced to false peaks (those local maxima not corresponding 
to 0?) of the underlying cost function. This component dominates in the low to very low SNR regimes and must be approximated numerically. Let all 
LCMV cost function local maxima within the search space of interest be given by the finite set = { 0 | 0i, 02, • • •, 0 m} where Q\ — 0 t is the 
true peak location and 0^ for k = 2,3,..., M are all other local maxima; The total MSE for the LCMV location estimate can be approximated by 

M 1 M 


= E S (01 — 0i ) (01 — 01 


T 


m- 


1 — "y ^ p ^0i — 0mj ■ Ey£>(0 1 ) + 'y ^ p ^0i — 0mj (0m — 0l) (0m — 0l) 


T 


(3) 


m- 


where the “no interval errors” component is described by the results of [Vaidyanathan, et al., 1995], and the “interval error” component has been 

discretized. The interval error probability p (Oi = 0 TO j represents the likelihood of the LCMV search algorithm choosing the false peak located at 

0 = 0m as an estimate, when the true signal is located at parameter value 0 = 0 1. As in [Richmond, 2004], the dominant term of the Union Bound 

sum can be used to approximate the interval error probabilities p ^0^ = 0 m ^ — Pr [PLCMvi^m) > PLCMvi^k)\ • Exact finite sum expressions 

for these probabilities under a complex Gaussian data assumption are given in [Richmond, 2004]. These results have been adapted in this work for the 
real Gaussian data assumption. 


NUMERICAL RESULTS 

Source localization is constrained to the cortical surface extracted from a subject’s MRI using FreeSurfer [Fischi, et ah, 1999]. The dipolar 
source is placed near the center of the somatosensory region at 0 t — [0.00225,0.04307,0.09936] meters relative to the origin. The forward 

solutions are based on sensor locations of BTi’s N = 74 channel Magnes II Biomagnetometer. The SNR of the dipole source is defined as SNR = 


tr [<rfh(0 T )h T (0 T )] /tr [all 


9 p 

= —#■ | |h(0y) 11 and varied in simulation by choice of erg. The realized MSE (truth) is obtained by averaging 11500 

Monte Carlo simulations of the search defined by equation (2) over the desired range of SNR values. The total MSE (trace of error covariance) is 
plotted in Figure 1 (b) along with the theoretical MSE predictions based on (3) for varied sample sizes L. These results are very encouraging as the 
predictions accurately point to the threshold SNR value and reflect the impact of variation in the data sample support L. The threshold SNRs occur at 
about — 14dB, — 17dB, and — 18.5dB for sample sizes L = 1.5 V, 2V, and 3V respectively and are well predicted by the theory. Note that a total root 
MSE of about 0.1 cm requires an SNR of about —5dB for L = 2 V. 

The benefit of this analysis is that it ties signal strength to localization accuracy for LCMV methods over a wide range of SNRs. It, therefore, allows 
one to assess LCMV performance under a wide range of physically meaningful conditions. 
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ABSTRACT 

Synthetic aperture magnetometry (SAM), a scalar minimum-variance beamformer, has been used to image source power or source signal-to-noise 
ratio from MEG. However, the locations of maximal event-related oscillatory activity (or changes from resting state) do not necessarily coincide with 
those sites that are phase-locked to external events (i.e., localized by dipole fit to the averaged evoked response). Since an estimate of the source 
time-series may also be obtained by applying the beamformer coefficients to the MEG signal, one can image event-related activity by mapping some 
function reflecting the reliability of the averaged source waveform at each location. We have devised a new analysis method, SAM (erf), for obtaining 
a functional image of event-related brain activity and revealing the corresponding waveforms for sites that are activated. The mapping function used 
in SAM (erf) is the ratio of RMS amplitude of the averaged source waveform to that of the ± average waveform, within a selected response time 
window. This function is computed at each coordinate on a three-dimensional grid throughout the head. In addition to the SAM (erf) functional image, 
the averaged source waveforms for each of the local maxima in the image can be computed and displayed. This procedure can reveal multiple 
locations and waveforms at sites in the brain engaged in event-related activities. When this method is applied to evoked response studies, phase- 
locked activity can sometimes be found in areas distant from primary sensory cortex. For example, in a somatosensory evoked response study, event- 
related activity is also seen in the anterior cingulate gyrus and cerebellum. Given the sensitivity of this functional imaging method to areas outside 
primary sensory cortex, it has the potential for detecting subtle changes in brain activity in health and disease. 

KEY WORDS 

Magnetoencephalography, functional brain imaging, event-related activity, minimum-variance beamformer, synthetic aperture magnetometry, 
phase-locked response. 

INTRODUCTION 


“Source localization” implies simplification of the complex activity of a very large numbers of neurons to a few parameters that help describe that 
activity. Those parameters represent a model that may be used to describe the gross functional organization of the brain and distinguish between 
normal and abnormal activity. All models are biased by their prior assumptions. Bias is introduced by selection of both source model and analysis 
methods. For electrophysiological brain measurements, two basic assumptions have been made regarding the association of cortical activity to 
events: 1) a portion of cortical activity is phase-locked to the events; 2) cortical rhythm is altered (either synchronization or desynchronization) with 
the events. These properties are, of course, readily demonstrated experimentally. The most common approach is to compute the averaged event- 
related signal (assuming stationarity), and fit one or more equivalent current dipoles (ECDs) to each peak. Here, an assumption is made that each 
peak in the averaged waveform represents a source or sources that are sequentially activated. While this is certainly true for some subcortical 
structures (e.g., the brainstem auditory evoked response), neocortical evoked responses generally involve parallel activation, overlapping in time. 
More recently, minimum-variance beamformers such as SAM have been used to characterize the spatial distribution of event-related changes in 
cortical rhythm within a specified frequency range and time-window, relative to the events. This analysis does not require that cortical activity be 
phase-locked to external events, and makes no assumptions as to the number of sources activated. However, since temporal resolution is inevitably 

compromised while measuring time-windowed source power, 
localizing the earliest events (i.e., the primary sensory or motor areas) 
is not assured. 

Phase-locked and rhythmic changes localize to approximately the 
same sites for each sensory or motor modality. Is this sufficient 
reason to believe that their sources are identical, or does this merely 
show different aspects of a more complex activity? To answer this 
question, one must devise an analysis that can reveal this relationship. 
By combining the advantage of SAM (i.e., estimation of source 
activity from multiple sites) with those of signal averaging, a 
mapping function can be derived that discriminates between phase- 
locked source activity and unrelated signals. Although SAM is 
usually used to measure source power at any given location in the 
brain, it can also estimate source waveforms. When the amplitude of 
an averaged source waveform is significantly larger than its 
corresponding ± average, one has statistical evidence that this site is 
engaged in event-related activity. By mapping this ratio at points 
throughout the brain, a functional image can be derived showing the 
spatial distribution of phase-locked brain activity. Addition of a low- 
pass filter into this analysis helps accommodate the presence of jitter 
in the evoked response. This procedure is referred to as “SAM (erf)” - 



METHODS 


Figure 1. Event-related imaging flowchart. SAM is used to compute SAM imaging of an event-related field mapping function, 
beamformer coefficients at regular intervals throughout the region of interest 
(ROI). At each location, source waveforms for the average and ± average are 
generated, and the ratio of event-related activity to background noise is The SAM (erf) imaging procedure is comprised of several steps 
determined. This results in a functional image in which the maxima indicate (shown in Fig. 1). A three-dimensional region of interest (ROI) is divided 
sources with activity that is time-locked to external events. into a re gular grid of coordinates. At each coordinate 6 , the SAM 
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Figure 2. Comparison of source waveforms (SAM virtual 
sensors) of average (black traces) and ± average (grey traces) 
for somatosensory evoked response to median nerve 
stimulation. The ratio of the averaged waveform to the ± 
averaged waveform is large for locations where activity is 
time-locked to the stimulus, and smaller elsewhere. The 
vertical line indicates the stimulus trigger. 


beamformer coefficients are computed from the covariance C of the 

unaveraged MEG data and the lead field [Robinson and Vrba, 1999], using 
the expression: 

w, =c- 1 b # (bJc- , b,)- 1 . 

These coefficients are applied to the band-limited average and ± average (the 
latter denoted by a bar through its symbol) of the same data, yielding source 
waveforms for the phase-locked and background source activity: 

S e (i) = W„ T m(?) and $ e {t)= WjM (t), respectively. 

Assuming stationarity, the ± average estimates the portion of the averaged signal 
remaining after cancellation of the phase-locked components. It is a measure of 
how much of the averaged source activity was unrelated to the external event. 
Next, the baseline offset is removed from the source waveforms, using the mean 
of the pre-stimulus interval (or any other time window defined as the baseline). 
The ratio of the average to the ± average, within a predefined response time- 
window indicates the degree to which that location is phase-locked to the event. 
In practice, this ratio (a variant of signal-to-noise ratio, or SNR) is computed from 


the RMS amplitudes of the two waveforms. Sites (voxels) at which local activity is phase- 
locked will have a large SNR, while sites that are unrelated to the event will have a small SNR 
- approaching unity in the limit where there is no event-related activity (illustrated in Figure 
2). The SNR value at each voxel forms a functional image showing the spatial distribution of 
phase-locking to the selected external event. Beamformer coefficients for the local maxima of 
this “SAM (erf)” image are then used to view the signal averaged source waveforms related to 
the event. 

This method was applied to somatosensory MEG data, collected with a 275-channel whole 
head instrument (CTF-275). Data were sampled at 1200 Hz in an unshielded environment, 
with a dc to 300 Hz passband. Electrical stimulus was applied to the right median nerve at 3.1 
Hz for 400 trials. 

RESULTS 

The SAM(eV) image found local maxima in both left and right hemisphere, with the 
largest ratio appearing at two sites along the left central sulcus. Although about 2.1 cm apart, 
these two sites differed in latency and waveform morphology. A sample of the averaged source 
waveforms, relative to the overlay of the average of all 275 MEG sensors, is shown in Figure 
3. The earliest peak latency, corresponding to the N 2 o was found in the anterior cingulate 
gyrus. An evoked response was also visible in cerebellum, with a peak latency of about 27 ms. 
All averages shown were much larger in amplitude than their respective ± averages, indicating 
that these were reliable regions of local phase-locking to the median nerve stimuli. 

DISCUSSION 

The relationship of the averaged source waveforms to the averaged MEG sensors is not 
immediately obvious. For example, the field map for each of the peaks of the sensor average 
could best be explained by a single dipole, whereas SAM (erf) explains the same data as the 
superposition of activity at multiple sites, with differing peak latencies. It is not surprising that 
the traditional technique of fitting a dipole to each peak of the averaged evoked response has 
given rise to the illusion of pure serial processing of sensory and motor activity. Jitter in the 
evoked response, relative to the event trigger, attenuates the higher frequency components of 
the averaged signal. Reducing the lowpass cutoff in the SAM (erf) analysis to 30 Hz revealed 
the presence of additional sources of SEF event-related activity, including a source in an 
associative area of occipital cortex. Thus lowpass filtering provides a convenient means of 
discriminating among sources with varying degrees of phase lock to an event. Manipulation of 
both stimulus and task could be used to reveal the functional relationship of the occipital 
source. 

We have shown, by example, that interpretation of event-related brain activity depends 
upon the source model and analysis. SAM (erf) analysis reveals parallel processing pathways, 
while not requiring specification of the model order. This complexity is not easily 
demonstrated when applying other analysis techniques. The SAM (erf) method has potential 
for being a sensitive method of showing subtle changes in event-related source activity in 
health and disease. 

REFERENCES 

Robinson, SE, Vrba J.. Functional neuroimaging by synthetic aperture magneto- 
metry. Recent Advances in Biomagnetism 1999, 302-305. 


Time (ms from trigger) 

24.2 30.0 35.8 48.3 68.3 

^ \ I / \ 



10 ms 


Figure 3. Event-related beamformer source 
waveforms reveal components of the evoked 
response to median nerve stimulation that are 
not apparent in the average of MEG sensors 
(overlay plot, top). The earliest peak appears in 
anterior cingulated gyrus (b). This is followed 
closely by events at two sites on the central 
sulcus (c) and (d). A response is also seen at 
about 27 ms in cerebellum (a). 
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ABSTRACT 

We have previously proposed mapping data into an appropriately chosen subspace prior to performing linearly constrained minimum variance 
(LCMV) source localization. Reducing the dimension of the data increases the statistical stability of the data covariance matrix estimate and results 
in less variability in LCMV source localization. In this paper we apply the bootstrap method to somatosensory evoked response data to evaluate the 
repeatability of original and subspace LCMV source localizations as a function of the number of data samples used to estimate the covariance matrix. 
The original 74-dimensional data is mapped into a 13-dimensional subspace designed to span the space corresponding to signals originating within the 
somatosensory cortex of the subject, as identified from an MRI. The subspace mapping provides a very significant reduction in localization variability 
even though the location is not restricted to the somatosensory region when searching for the maximum of the LCMV cost function. For example, 
given 150 data samples for estimating the covariance matrix, over 97% of the 1000 subspace bootstrap resamples are within 1cm of the mean location, 
while only 50% of the original LCMV resamples are within 1cm of the mean location. These results provide compelling evidence of the efficacy of the 
subspace approach. 

KEY WORDS 

MEG, subspace mapping, dimension reduction, source localization, linearly constrained minimum variance filters, bootstrap. 

INTRODUCTION 

Linearly constrained minimum variance (LCMV) source localization [Van Veen, et al., 1997] is based on a map of estimated power as a function 
of location. The estimated power is determined based on an estimate of the data covariance matrix. The quality of the data covariance matrix estimate 
determines the variability in source localization. It is known that a large number of independent data samples describing the phenomenon of interest are 
required to obtain high quality estimates of the data covariance matrix, e.g., at least three to five times the number of sensors [Reed, et al., 1970]. For 
many MEG applications this requirement is difficult to achieve. For example, a modest 74 sensor array would require over 225 statistically independent 
data vectors to obtain a good covariance matrix estimate. We have shown that this requirement is relaxed by mapping the data into an appropriately 
chosen subspace prior to performing LCMV localization [Van Veen, et al., 2002]. In this paper we apply the bootstrap technique to demonstrate the 
tremendous reduction in source localization variability that is obtained with subspace localization. Bold lower and upper case symbols denote vectors 
and matrices while superscript t and — 1 represent matrix transpose and inverses respectively. The trace of a matrix is written as tr A and the symbol I 
denotes the identity matrix. 


METHODS 


Let the N x 1 column vector x. J (k) represent the magnetic field strength measurements at N sensor 
sites for epoch j and time k. We assume a total of J epochs and K time samples, so JK data vectors are 
available, each of which consists of L dipolar sources and noise as shown by 


L 


yj (k) = H(0^)m] (/c) + n \k) 


( 1 ) 


i— 1 



The columns of the N x 3 matrix H (Oj) represent solutions of the forward problem for unit amplitude 
dipoles in the x, y and z directions at location 0^, (k) represent the 3x1 dipole moment associated 

with a given source, and the N x 1 vector n J (k) represents noise. In a typical MEG evoked response 
experiment multiple epoch time series are obtained where the underlying source configuration is assumed 
the same across epochs. 

Let the M-dimensional subspace data vectors be z J (k) = T^x- 7 (k) where the subspace mapping matrix 
T is designed as described in [Van Veen, et al., 2002] to represent signals originating from a predefined 
sector of interest in the cortex, e.g., the somatosensory cortex shown in Figure 1, with minimum source 

representation error. Here representation error is the norm of the difference between the forward solution H(0) and the subspace projected version 
TT*H(0) averaged over all 0 in the sector of interest. Source localization via subspace LCMV is performed by evaluating the neural activity index 
cost function [Van Veen, et al., 1997] 

tr [U t (0)TC~ 1 (z)T t U(0)] 


Figure 1: Somatosensory cortex selec¬ 
tion for subspace design depicted on the 
inflated cortical surface. 


-l 


NI(0) 


-l 


( 2 ) 


tr [U t (0)TQ- 1 (z)T t U(0)] 

where C(z) is the subspace data covariance matrix, Q(z) is an estimate of the subspace noise covariance matrix, and the columns of the matrix U(0) 
are an orthonormal basis for the space spanned by the columns of H(0). Local maxima in the neural activity index correspond to source locations. 
Localization in the original space is performed using T = I in (2). The data covariance matrix is estimated using the sample covariance matrix 


J K 


C(z) = 


JK 



rJ (k)z J (k) 


(3) 


j =1 k=l 
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Epochs 

Dist(cm) 

300 

150 

100 

50 

0-1 

67/97 

49/88 

39/80 

— / 38 

1-2 

17/1 

22/11 

23/4 

- / 30 

>2 

16/2 

29/1 

38/16 

-/32 


A) Multiple time samples 



Epochs 

Dist(cm) 

300 

150 

100 

50 

0-1 

79/100 

50/98 

30/93 

-166 

1-2 

13/0 

26/0 

27/2 

- / 21 

>2 

8/0 

23/2 

43/5 

-/ 13 

B) Sing 

e time sample 


Table 1: Percentage of bootstrap resamples for which the source is localized within the specified distance from the mean location for the origi¬ 
nal/subspace LCMV methods. 


The bootstrap method [Efron and Tibshirani, 1996] creates multiple data sets by 
re-sampling with replacement from the set of available data vectors. The statistics 
of the source localization estimates are estimated from the sample statistics of the 
bootstrap resamples. 

The MEG data is recorded with a 74 channel SQUID magnetometer (Magnes II 
Biomagnetometer Technologies). Three hundred 500ms long somatosensory evoked 
responses are collected by pneumatically stimulating the right hand index finger of 
a healthy adult female. A 520.8 Hz sampling rate, 0.1 Hz high pass, and 100 Hz 
low pass in line filters are used during the recording. The neural activity index for 
the subspace and original cost functions are constrained to the gray matter surface 
extracted using FreeSurfer [Fischl and Dale, 1999] from an MRI scan. The source 
location is chosen as the location out of 120, 000 coordinates on the left hemisphere 
corresponding to the maximum value of the neural activity index. The subspace 
mapping T used in the experiments is designed to represent the somatosensory cor¬ 
tex in the left hemisphere shown in Figure 1. A 13 dimension subspace is chosen. 

RESULTS 

Data covariance matrices are estimated using both multiple epochs from a single 
time sample, and multiple epochs with multiple time samples. For the single time 
sample case we use data at 78.7 ms, see Figure 2. For the multiple time sample case 
we use data from 67.2 ms to 105.6 ms, see Figure 2. For both cases we estimated 
the covariance matrix using 300, 150, 100 and 50 epochs. 

Table 1 shows LCMV source localization results based on 1000 bootstrap re¬ 
samples for the multiple and single time sample experiments. The number of exper¬ 
iments where the source is localized within a given distance (0 — 1 cm, 1 — 2 cm, 
> 2 cm) of the mean source location is computed to assess the variability of each 
method. Notice that in Table 1 the original LCMV method does not include the 50 
epoch case because the estimated data covariance matrix is singular. This occurs 
whenever the number of data vectors used to estimate the covariance matrix is less 
than its dimension. The mean locations are generally biased towards the motor strip 
as illustrated in Figure 3. The original and subspace mean locations are very similar, 
so the subspace approach does not introduce significant bias. 

DISCUSSION 

The results demonstrate that mapping the data into an appropriately designed 
subspace dramatically reduces the variability of LCMV source localization as a con¬ 
sequence of the increased statistical stability of the lower dimensional subspace co- 
variance matrix. Furthermore, the subspace approach is advantageous in situations 
with limited data since subspace LCMV localization can be performed when the 
original LCMV localization is undefined. 
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ABSTRACT 

The inverse electrostatic problem is generally underdetermined, with many solutions. Many algorithms exist for choosing a solution, and 
appropriate methods are application dependent. Most require a complete, or nearly complete, sampling of the candidate source space at the 
resolution desired for the inverse. This is a reasonable requirement for analytic forward models of the head, such as concentric spheres, but is 
computationally intense for numerical models of the head, such as those that make use of the finite element method. This study presents an approach 
that can isolate a spatially concentrated region of source activity from a sparse sampling of the candidate source space, thereby minimizing the 
number of forward computations in the head model. Our results show localization performance comparable to an exhaustive search with forward 
solutions on 11% of the candidate cortical sites. This technique is therefore highly applicable for use with anatomically-realistic, patient-specific 
models in applications with spatially concentrated source activity, such as the localization of epileptic foci. 

KEY WORDS: Electroencephalogram, Inverse, Finite Element, Head Model, LORETA 

INTRODUCTION 

Localization of current sources in the brain from the electroencephalogram (EEG) requires a solution to the inverse electrostatic problem, which 
can be reduced to solving a mapping of the form: 

Ls = v (i) 

where v is a vector of length m representing electric potential measurements at an array of scalp electrodes, s is a vector of length n representing the 
source amplitudes, and L is an m x n matrix “lead-field” matrix. Matrix L may be approximated by simulations of current flow in the head, which 
requires a solution to the Poisson equation. This creates few practical difficulties in analytic models of current flow, can be computationally intense 
in some numeric approaches. In this application n is typically much larger than m, and (1) represents an underdetermined and ill-posed problem with 
many solutions. There are many approaches for selecting a pertinent solution. For example, the minimum-norm solution will generally contain a 
wide distribution of sources. Such solutions are not satisfactory in applications where the sources are known to be more spatially concentrated. One 
possibility is to choose the solution that minimizes a weighted norm. If a numerical forward model is used, a difficulty in applying such techniques is 
that they require the entire matrix L. At millimeter resolutions, this can easily represent tens of thousands of forward solutions. 

In cases where it is known, a-priori, that the desired solution is spatially concentrated, it is natural to ask whether one can start with a course 
sampling of the potential source space, compute an inverse, and use some feature such as the residual error as the basis for a more densely sampled, 
yet spatially constrained inverse, in an iterative fashion. The ill-posed nature of the inverse makes this a difficult proposition, even when there is only 
a single source, because the residual error surface is nonlinear. Thus the location producing the minimum residual norm at a particular iteration is not 
necessarily (or even usually) the closest to the actual source, and such a procedure can easily be lead astray of an actual source location. What can 
help is the same spatial smoothing that is the basis for many weighted norm approaches. We desire a spatial smoothing such that the peak or first 
moment of the inverse can lead to the correct spatial region for refinement. Spatial smoothing is often implemented as differencing operations over 
some neighborhood of each source location. Such operators are cumbersome to apply over irregularly sampled spaces. The technique that we 
illustrate here has the additional advantage of producing the desired source smoothing without requiring the definition of a spatial operator. 

In this study we test such an approach using an anatomically-realistic head model. We compare the localization performance of this method to 
that of an exhaustive search over all potential source locations. An exhaustive search provides a reasonable performance benchmark because it is not 
possible to do better than an exhaustive search when the number of sources is known, and comparing performance relative to an exhaustive search 
allows us to factor out the effects of other considerations, such as EEG sampling density and approximation errors in the forward model. 

METHODS 

We used a realistic representation of the conductor volume based on 
magnetic resonance (MR) images of a human head. The model contained 
11 tissue types at a resolution of 2 x 2 x 3.2 mm. The modeled domain is 
illustrated in Fig. la, along with the 145-lead EEG configuration. The 
cortical surface was sampled with an average spacing of 6 mm, producing 
2035 candidate source locations shown in Fig. lb. The lead-field was 
calculated using the finite element method. 

The spatial refinement algorithm, described below, was compared to an 
exhaustive search algorithm over a signal-to-noise ratio (SNR) from -10 dB 
to +30 dB, in 5 dB increments. For each algorithm, 50 trial inverses were 
run. For each trial, a unit dipolar source was placed at a randomly selected 
location, with a random orientation. The forward problem was then 
simulated and observations taken at the EEG sample points. White 
Gaussian noise was added to each channel in order to achieve the desired SNR. Statistics were collected on the distance error between the actual 
source location and the output of the inverse algorithm, and plotted against SNR. 

The exhaustive search algorithm simply iterated through each possible source location and performed a least-squares fit of a dipolar source at that 
location to the observed data. The location producing the smallest residual norm was selected as the estimated source location. The spatial refinement 
algorithm is summarized in Fig. 2. We began by defining a set of successive decimation ratios (dj) on the set of candidate sources. The initial ratio 
was selected to produce a targeted number of total sites selected by the user (n 0 ). Columns of the original lead-field were then decimated to the 
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Figure 1. (a) Head model and EEG points (b) Cortical samples. 
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corresponding source locations (L^, as was the array of source coordinates 
(li). The decimation ratios for successive iterations were reduced by not 
more than a factor of 4 in order to achieve an approximate doubling of 
resolution in the source space. At each iteration, an inverse on the current 
source was computed using the sLORETA algorithm [Pascual-Marqui, 

2002]. This algorithm uses a Tikhonov-Philips regularization and we chose 
the regularization parameter using the discrepancy principle [Hansen, 1998]. 

The source with maximal “normalized” power was then selected as the 
center point for spatial refinement in the next iteration. Successive iterations 
included sources within a spherical region at successively higher resolutions 
determined by the next decimation ratio. At each iteration the radius was 
chosen such that the sphere included not less than the same targeted number 
of source sites, and as near to the targeted number as possible with unit 
changes in the radius. 

RESULTS AND DISCUSSION 

Fig. 3 compares the mean location error of the spatial refinement 
algorithm to an exhaustive search vs. SNR. Vertical bars indicate the 95% 
confidence interval. The overlapping intervals indicate statistically 
indistinguishable performance in terms of the mean error. In this example 
n 0 =56, and the average number of sites for which forward computations were 
required was 222. All 2035 sites were examined in the exhaustive search. 

For unconstrained dipole orientations, each site requires 3 forward 
computations, so the algorithm required 666 computations compared to 6105 
for the exhaustive search. Thus the spatial refinement algorithm performed 

the inverse localization as well as can be done using only 11% of a full- 
resolution lead-field Figure 2. Flowchart of spatial refinement algorithm. 

The number of forward solutions can be reduced by reducing the targeted number of 
source sites per iteration, n 0 . It is reasonable, however, to expect that such reductions 
will have an effect on the localization performance. Fig. 4 illustrates the results when no 
was reduced to 32, which reduced the number of total sites visited to 128. The 
worsening performance at higher SNRs is explained by the fact that a reduction in the 
targeted number of sites resulting in an over-determined system will drive the 
regularization towards a standard least-squares fit when the noise level is small, but not 
when the noise level is large. This eliminates the smoothing benefits of sFORETA, and 
the highly nonlinear shape of the error surface then leads the algorithm astray of the 
actual source location. The targeted number of sites must therefore exceed the number 
necessary to maintain an underdetermined. Because this study used unconstrained 
dipole orientations, s contains three degrees of freedom per source location, and L 
contains three columns per source location. In this case, we must therefore include more 
than (m/3) candidate sites at each iteration. The required number of forward 
computations is thus linked to the number of EEG observation points, m. Knowledge of 
the localization performance vs. EEG sampling density is therefore of interest with this 
algorithm. 

A limitation of this algorithm is that it is targeted towards the localization of a single 
source, or a local concentrated region of activity. In the latter case this algorithm could 
be used to provide a locally constrained initialization of a higher resolution algorithm 
such as FOCUSS [Gorodnitsky et al., 1995]. 

There are other approaches that do not require forward solutions to the entire source 
space, such as: Nelder-Mead simplex, simulated annealing, and genetic algorithms. 
Simplex can be trapped in local minima of the goal function, which is generally based on 
the residual norm. Statistics on the number of sites visited by an algorithm are rare, but 
there is evidence that simulated annealing and genetic algorithm approaches tend to 
explore a much larger portion of the source space than the algorithm presented here. 
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ABSTRACT 

Distributed solutions to the electrostatic inverse problem provide an estimate of source activity at every possible location. This leads to a highly 
underdetermined and ill-posed problem with multiple solutions. Algorithms that choose a particular solution often favor one of two extremes: they 
provide either a relatively high resolution image of a few sources, or a low-resolution rendering of primary regions of activity. This paper presents a 
refinement of the FOCUSS algorithm that improves the balance between these two extremes. Our results show perfect reconstruction of a complex 
source configuration on noise-free data, and a reconstruction superior to FOCUSS on noisy data. 

KEY WORDS: Electroencephalogram, Inverse, Finite Element, Head Model, FOCUSS, LORETA 

INTRODUCTION 

The EEG inverse is, in the general case, an underdetermined system. Some form of a-priori information must therefore be applied in order to 
choose among the multiple solutions, and various approaches have been developed. For example, in localizing very focal activities it may be 
appropriate to just a few equivalent dipole sources to the EEG. In such cases, the problem becomes over-determined, and the head model is 
“scanned” in some manner for the few source sites that best match the EEG. Distributed source models have also been studied. These algorithms 
pursue a solution for all potential source sites in toto. The number of sources is not typically limited beyond the selection of a general region, such as 
the cortical surface. With distributed source models the problem formulation remains underdetermined, and various strategies exist for choosing 
among the multiple solutions, often by minimizing a cost function that satisfies some criterion. A weighted minimum norm often plays a central role. 
Some approaches, such as sLORETA [Pascual-Marqui, 2002] may be characterized as having good localization, but low resolution, while others, 
such as FOCUSS [Gorodnitsky, 1995] are oriented towards high resolution but favor localized solutions and are dependent on initialization. 

In this paper we present a modification to FOCUSS that uses sLORETA for initialization and also applies “standardization” during the FOCUSS 
iterations. We also trim the candidate source space during FOCUSS iterations for computational efficiency. Simulation results with noisy EEG data 
are presented in a subject-specific head model using the Finite Element (FE) method for forward computations. 

METHODS 


The distributed source localization problem can be stated as: 

<f)=KJ ( 1 ) 

where O is an m x 1 vector of electric potentials from electrodes on the scalp, J is a 3n x 1 vector representing current sources at n locations in the 
brain, with 3 orthogonal components per location, and K is the lead-field matrix representing system transfer coefficients from each source to each 
measuring point. A unique solution to (1) can be achieved by zero-order Tikhonov-Philips regularization, as follows: 

j = K t [KK r + al] ' KJ = RJ ( 2 ) 

where a is the regularization parameter, which we choose according to the discrepancy principle [Hansen, 1998]. This choice for a balances the 
minimization of the norm and the residual such that the residual matches the expected noise level, which we assume can be estimated from a 
quiescent, or pre-stimulus, EEG recording. R is the resolution matrix, defined as: 

R = K T [KK r +al\' K (3) 

The resolution matrix describes a mapping from the actual source activity to the estimated activity. The basic idea of sLORETA is to normalize the 
estimated source power using the resolution matrix R, as follows: 

x rji 1 A 

jf (R«)~ J/ (4) 

/V 

where J z is a 3 x 1 vector containing the source estimate at the / location and R// is a 3 x 3 matrix containing the / diagonal block of the 
resolution matrix. Though the image produced by sLORETA is blurred, the peak is correctly centered at the location of the source. It has been 
suggested that an ideal initialization of FOCUSS would be a low resolution estimate, so we chose sLORETA as our initialization for FOCUSS. 
FOCUSS is a high-resolution algorithm that recursively implements the weighted minimum norm (WMN) method and converges to a sparse 
solution. In this study we investigate the efficacy of applying the standardization concept to FOCUSS as well. At each iteration, /, a regularized 
implementation of FOCUSS computes a WMN solution to (1) as follows: 

j. = WjWi T K T [KWjW^K 7 + al] ‘o (5) 


where w . is a diagonal 3 n x 3 n matrix which is recursively refined based upon the current density estimated by the previous step: 

W, = W ( ._j [Diag (J w (1), J w (2),..J(3«))] 



where the factor W M i s included only in the “compounded” version. The FOCUSS algorithm converges to a 
contribution from at most m sources. Inspired by the localization performance of sLORETA, we incorporated 
algorithm. Applying the definition of the resolution matrix in (1) to each FOCUSS iteration in (5), we obtain: 

R. = W ( W, 7 K 7 [k\V, W 4 K 1 +al\~'K 


localized solution with a non-zero 
standardization into the FOCUSS 
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and we normalize the power of the source estimate for each iteration of FOCUSS as in (4). Iterations are continued until the solution does not change 
in two consecutive steps or the solution becomes sparser than m. As it iterates, FOCUSS creates an increasingly sparse solution by forcing most 
sources to zero. We therefore remove small sources from the problem formulation at each iteration in order to reduce computation time. The 
definition of “small nodes” may be application-specific. In the example presented here, at each iteration we discarded nodes with source power less 
than 10% of the maximum, as long as the resulting source space would not fall below the number of measurements, m. This prevents the formulation 
from becoming over-determined, which can generate undesired solutions with a large residual. 

To evaluate this algorithm, simulations were carried out in a realistic head model based on classified magnetic resonance (MR) images of a 
human head. The model contained 11 tissue types at a resolution of 2 x 2 x 3.2 mm. The cortical surface was sampled with an average spacing of 4 
mm, producing 3035 candidate source locations. A 145-channel EEG lead-field for these cortical surface sites was calculated using the finite element 
method [Schimpf, 2002], resulting in a lead field matrix of dimension 145 x 9105. Common average referencing was used. 


RESULTS AND DISCUSSION 

We present results for 8 test sources, located as illustrated in Fig. 1. This configuration is 
challenging in that we attempt to recover localized details for two spatially disparate source clusters, 
with one of the clusters containing deep sources. An “L” shaped cluster of four sources is located at 
the base of the occipital cortex, and a linear cluster of four sources is located in the limbic association ioo 
area of the prefrontal cortex. The sources were dipolar with an orientation into the plane of Fig. 1. 

All sources were given the same magnitude. The EEG produced by these sources was simulated and 
white Gaussian noise was added for a total signal-to-noise ratio of 20 dB. The superimposed 
rectangle identifies the region illustrated in subsequent images. 

Fig. 2 illustrates the inverted source image using the “standardized” implementation of FOCUSS 200 
discussed above, initialized using sLORETA. Compounding of the weight matrix was not used. The 
results are shown in the plane of Fig. 1 along with the adjacent planes on either side. Darker sources 
represent higher amplitudes. Although there is some local leakage, both the location and shapes of 
the sources are represented. Some additional source leakage into other nearby planes occurred which 
are not illustrated here. A noise-free simulation of this source configuration was reconstructed 
perfectly by this algorithm. Fig. 3 illustrates the inverted source image using the FOCUSS algorithm initialized by the unbiased minimum norm 
[Gorodnitskly, 1995]. Compounding of the weight matrix was not used. Although the shape of the source cluster in the limbic association area is 
discernable, the occipital sources are not. 

Although we present only a single test case here, we have found that, in general, initializing FOCUSS with sLORETA and applying the 
standardization concept during FOCUSS iterations enhances the reconstruction of local cluster shapes. 
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Figure 1. Location of actual sources 
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Figure 2. Inverted sources using “standardized” FOCUSS, initialized by sLORETA. 
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Figure 3. Inverted sources using FOCUSS, initialized by the minimum norm 
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ABSTRACT 


We will present a new and fast regularization procedure for reconstructing current densities based on EEG and / or MEG measurements. It is our 
goal to achieve an additional stabilization of the solution which is superior to known procedures: our approach does not only consider spatial 
smoothness but also temporal smoothness of activation curves as a-priori information. 

KEY WORDS 

Current density reconstructions; linear regularization; Spatio temporal regularization; model adapted filtering; dynamic inverse problems. 

INTRODUCTION 

Current source density reconstruction algorithms are based on the linear relationship between a time series of measurement vectors TTL t and a 
time series of solution vectors J t , representing the strengths of a collection of predefined current dipoles, usually sampling the entire possible 

solution space. This relationship is mediated by a lead field matrix L . It is well known that the inverse problem of EEG and MEG suffers from 

_ • 

general non-uniqueness, i.e. there is always an entire class of possible solutions. This is due to the fact that the vectors J t have a higher dimension 

than the measurement vectors m t . If the solution space is not restricted beforehand by e.g. assuming a single focal generator, additional 

assumptions have to be used to get an unique solution of the problem, in most cases one uses the generalized inverse IL which avoids this 
shortcoming. However, the resulting linear estimators suffer from a bad condition, i.e. small perturbation in the measurement, e.g. by noise, lead to 
unusable solutions. This leads to the necessity of regularization. For example, the well-known Tikhonov-Philips based regularization procedure 
solves 


I, I H - 


m 


+ 


^S,|| R], 


—> min 


Here R is a regularization matrix, e.g. R = I or R = A. The parameter X is called regularization parameter. The second sum models a- 
priori information as spatial smoothness and stabilizes the solution in the presence of noise. In this formulation no temporal couplings occur. 

As activation curves of neural activity are smooth, we introduce temporal smoothness as further a-priori information about the current densities 
we are searching for. We thus extend the method above as follows: 


Z t\\ L J, ~ m tf + t\\ R J, If t\\j t+ i -Jt 


—> mm 


The last term measures temporal smoothness, minimizing the last sum favors current densities with smooth activation curves. This extension can 
be applied to other problems as dynamic computerized tomography and dynamic impedance tomography, see [Schmitt, 2002]. 

METHODS 

Trying to solve the second minimization task with the same methods used for solving known Tikhonov-Philips based minimization problems one 
encounters efficiency problems which lead to unacceptable solution times ranging from some days to weeks. But using the difference matrix 

D = {~O i . + S j+l j ) j j of size ( T — 1) X T , that is D has the value — 1 where j = i and + 1 in the case j = i + 1 , our problem is 
equivalent to solving the so-called Sylvester equation [Schmitt, 2002] 

(L T L + AR T R) X + /jXD t D = M 

which can be solved much faster. Solution times range from seconds to a few minutes on a standard desktop computer, depending on the size of 
the problem. Here the temporal measurements form the matrix M by column wise arranging. In the same way the solutions form the columns of 

X . For some special regularization operators, e.g. R = I , we can achieve further speedup solving a Sylvester equation involving LL [Schmitt, 

2002], which is in general much smaller than L L : if L is nx3N , where il is the number of measurement channels and N is the number of 

T T 

influence nodes, then LL is Tl X n and L L is (3 N) X (3 AO- 
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RESULTS 


For studying the general behavior of STR, we used a simple setup, see Fig. 1: The influence space is a 10x10 grid in two dimensions. Nine 
sensors are placed in a planar array above the grid with center (5.5, 5.5, 2). The center of the influence space is (5.5, 5.5, 0). We assume constant 
conductivity in the whole space, so the leadfield matrix can be computed analytically. Two equally oriented dipoles with moment (0,0,1) at x=3, 8 
both at y=5 are placed on the 10x10 grid.. Gaussian dipole-strength time series are assigned to each dipole, they are drawn in Fig. 2. We generated 

synthetic data and added 30% uniform noise. All methods use R = I as regularization matrix. We calculated temporal uncoupled Tikhonov-Phillips 
solutions and STR solutions. The first used temporal smoothed data (Savitzky-Golay filter of order three and length five). Based on these results, 
Figure 1 shows current density reconstructions, in Figure 2 reconstructed activation curves are drawn. The improved stability in presence of noise is 
apparent. 



Figure 1: Left: setup of the simulation . Middle: current densities calculated from temporal uncoupled Tikhonov-Phillips solution. Right: 

current densities calculated from STR reconstruction. 



Figure 2: Left: activation curves of the two dipoles used for generating synthetic data. Middle: activation curves based on temporal uncoupled 

Tikhonov-Phillips solutions. Right: activation curves calculated from STR reconstruction. 


DISCUSSION 

One important question is if we can achieve the same results by smoothing the data before using uncoupled linear methods. Regarding 
regularization procedures \ Y — > N(A ) 1 (Z X of an ill posed operator A \ X —> Y , one can define the filter F y AT y which acts on 

data. We can reconstruct the regularization T from by applying the Moore-Penrose inverse A + to the filter: 

A + F r =A + AT y =P N(A) 1 T r =T r . That is, one can switch from regularizations to filters and vice versa. These filters are not arbitrary, they 

are adopted to the underlying problem. For a deeper discussion see [Louis, 1999]. 

We computed the filter F STR which belongs to the STR procedure, explored the structure of this matrix and observed that a decomposition 

F str — (Ij> ® F 1 spatial \F tempora i ® I n ) is not possible in general, except for unrealistic cases as T=1 and n=l. Here ® denotes the Kronecker 

product. As a result, STR reconstructions cannot be obtained by applying known spatial regularizers to temporally smoothed data. STR interweaves 
spatial and temporal a-priori information to a new regularization method. 
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ABSTRACT 

An analytical method for solving the inverse problem in magnetocardiography is presented. The suggested approach uses the Fourier transform 
and eigen values and vectors method to define spatial location and magnetic moment components of an effective magnetic dipole, considered to be 
the source of the weak magnetic field produced by the human heart. The advantages of the presented approach are high accuracy and stability for 
magnetic field source localization. Utilization of the FFT algorithm assumes knowledge of the magnetic field distribution not only within the 
measurement grid, but also in the extended area, which is generally unknown. The goal of this research is to improve inverse solution algorithms 
performance using advanced interpolation to restore the missing data outside the measurement grid. Inaccuracy of localization also increases when 
the dipole position in the measurement plane is close to the measurement grid boundaries. To overcome this shortcoming preliminary dipole 
localization is used to estimate the source location in the x-y plane and then the magnetic field is interpolated in the extended area, which is centered 
relative to the center of the estimated dipole location. As a result we have obtained high accuracy localization of the magnetic field source, which 
does not exceed 7 percent from the theoretical dipole location even if the field extrema are not symmetrical to the center of the measurement grid. 
The results of the numerical solution are presented for the theoretical dipole and real MCG data during the repolarization phase of the heart. 

KEY WORDS 

Inverse solution, localization, magnetic dipole, data interpolation. 

INTRODUCTION 

Creation of magnetometric systems based on SQUIDs, possessing unique sensitivity and precision, has stimulated investigations in the area of 
weak magnetic field spatial analysis. Modern SQUID-gradiometers permit registration of weak magnetic fields generated by the human heart, brain 
and muscles. Interpretation of the measured data requires development of methods and algorithms to convert the information obtained to a form more 
suitable for further analysis. 

One of the most important problems of biomagnetic investigations is determination of 
the location (spatial coordinates) and magnetic moment of a field source. In this paper we 
present a new approach to the inverse problem, which gives a robust analytical and 
numerical solution of the effective magnetic dipole. 

The goal of this study is to define the most probable location and magnetic moment of 
an effective magnetic dipole, which is considered to be a source of the measured magnetic 
field. 

METHODS 

Let us assume the magnetic field is measured by second-order gradiometers in the 
standard 6x6 node grid above the human chest and distance between nodes is 4 cm (Figure 
1). The measured magnetic field is approximated by a source, which is represented by a 
point magnetic dipole of moment M, located at the origin of the “global” Cartesian system 
of coordinates XYZ. The position of the observation point relative to the dipole is denoted 

by R and N is a unit vector in the direction of R . 

The magnetic field component measured by each sensor is defined by the following 
expression: 

d 2z = B z (x,y,z 0 )-2B z (x,y,z 0 +l) + B z (x,y,z 0 +2l) (1) 

The scheme, represented in Figure 1, provides the magnetic field 
data within the standard 6x6 measurement grid. However the magnetic 
field is not equal to zero outside this grid and it can significantly effect 
the inverse solution results. It is known that the magnetic field is 
continuous within the whole space outside the volume of source and 
tends to zero as the distance to the source increases. 

The inverse solution method is based on the Fourier transform and 
eigen values and vectors method [Tikhonov, 1963]. The magnetic 
field, produced by the heart, nearly disappears at distances that are equal 

or greater to 1L X 1L , where L is the dimension of the standard 
20x20 cm measurement grid. We need to restore the data within this 

1L X 1L grid. 

First the measured data (Figure 2, left) are interpolated from the 
standard 6x6 grid to a new smothed grid (Figure 2, right) and global 
minumum and maximum coordinates are estimated. These extrema are used to calculate the preliminary dipole location, which is considered to be at 
the middle between found extrema. 




6 . p>cA s 


Figure 2. Magnetic dipole field values at the nodes of standard 6x6 
grid (left) and its interpolated values with 0.5 cm step (right). 



Figure 1. Coordinate system for the source and 
measurement grid. 
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Second, the extended 7L grid is constructed. The center of the extended grid matches with the measurement grid (Figure 3). The center of the 5L 
area, which is white on Figure 3, is positioned to the preliminary dipole, which was found. This 5L area will be used to calculate interpolated data 
using the measured data (dotted square in the center) and zero filled boundaries of the extended grid (gray area). 

The two-dimensional spline interpolation [Press, 1992] has been used to restore the data within the 5Lx5L area. Figure 4 (left) shows the 
extended data, generated by a magnetic dipole, placed in the center of the grid, using the formula (1), while Figure 4 (right) demonstrates results of 
the extension technique applied to the initially measured 20x20 cm area. The interpolated field matches well with the theoretical distribution. 


RESULTS 

Table 1 illustrates results of the numerical solution for the 
inverse problem using the analytical method. First magnetic field 
distribution was calculated from the theoretical magnetic dipole, 
located at x=140 mm, y=60 mm and z=180 mm relative to the 
upper left comer of the measurement plane (Figure 3, right). Then 
the effective magnetic dipole parameters were calculated using this 
field distribution at the measurement plane. The effective magnetic 
dipole dynamics have been used for normal/abnormal MCG map 
estimation (Figure 5). 



Nnm 

Ymm 

Z|nm 

m x 

m y 

m z 

Theoretical 

Dipole 

140 

60 

180 

0.707 

-0.707 

0.0 

Calculated 

Dipole 

142 

56 

183 

0.703 

-0.711 

-0.015 



Figure 3. An ideal dipole location (left) and shifted dipole postion 
(right). L is the measurement plane dimension, 5L is the extended grid 
dimension. The gray area is zero filled, the white area is interpolated. 


Table 1. Numerical solution results. 

DISCUSSION 

The analytical method and developed software can be applied for 
medical investigations of the human heart and brain. The presented 
approach has high accuracy in the magnetic dipole source 
localization as may be required for arrhythmic source localization in 
the heart or specialized catheter location and orientation. Optimized 
interpolation of the measured field gives stable dipole localization 
even if field extrema are not symmetrically located to the center of 
the measurement grid. Effective magnetic dipole localization results 
taken from the heart allow functional estimation of the dynamics of 
electrical instability of the cardiac muscle during the repolarization 
phase and differentiate normal and abnormal patterns for ischemic 
heart disease. 



Figure 4. Magnetic dipole field distribution at the extended grid, 
calculated from formula (1). 
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Figure 5. Effective magnetic dipole dynamics for normal subject (left) and abnormal 
(right) during repolarization phase. The map is shown for T wave end. 
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ABSTRACT 

This paper proposes a simple statistical method for extracting target source activities from spatio-temporal source activities 
reconstructed using MEG measurements. The method requires measurements in a control condition, which contains only non-target 
source activities. The method derives, at each pixel location, an empirical probability distribution of the non-target source activity 
using the time-course reconstruction obtained from the control period. The statistical threshold that can extract the target source 
activities is derived based on the empirical distributions obtained from all pixel locations. Here, the multiple comparison problem is 
taken into account by using two step procedure: standardizing these empirical distributions and deriving an empirical distribution 
of the maximum pseudo T value at each pixel location. The results of applying the proposed method to the auditory evoked 
measurements are presented to demonstrate the method’s effectiveness. 

KEY WORDS 


Nonparametric statistics, MEG, Source reconstruction, Statistical significance test 

INTRODUCTION 

This paper proposes a simple method for this statistical subtraction between task and control measurements; the method is 
applicable to results of spatio-temporal source reconstruction from MEG/EEG measurements. This method assumes neural activities 
to be quasi stochastic, and it uses nonparametric statistics to derive an empirical probability distribution of these activities using 
the time course reconstruction in the control period. This empirical distribution is then used for deriving the statistical threshold; 
the thresholding can extract the target activities that exist only in the task measurements by eliminating other non-target activities 
that exist both in the task and control measurements. One method for implementing such statistical subtraction has been recently 
proposed[Pantazis,2003]. The method uses the permutation tests to assess the statistical significance of the source reconstruction 
results, and thus it is very computer intensive. On the contrary, the method proposed in this paper does not use such computer 
intensive re-sampling methods as the permutation tests or the bootstrap, and it can be implemented with much shorter computational 
time. In this paper, we explain the proposed statistical thresholding method with spatial filter source reconstruction [Sekihara, 2004]. 
This is because the formulation of the spatial filter is relatively simple and the spatial filter techniques have been successfully applied 
to MEG source analysis [Robinson, 1998] [van Veen,1997]. However, the applicability of the proposed method is not limited to the 
spatial filter formulation and it can be used with any types of source estimation methods that can provide the spatio-temporal 
source reconstruction, i.e., that can reconstruct source time courses at all pixel locations. 

PROPOSED NONPARAMETRIC STATISTICAL SIGNIFICANCE EVALUATION 


We define the magnetic field measured by the rath detector coil at time t as 6 m (£), and a column vector b(t) = [bi (£), 62 (t),..., 6 m (£)] T 
as a set of measured data where M is the total number of sensor coils and superscript T indicates the matrix transpose. The spatial 
location is represented by a three-dimensional vector r: r = (x,y,z). The magnitude of the source moment is denoted s(r,t). 
Spatial filter techniques estimate the source current density by applying a simple linear operation to the measured data, i.e., 

M 

s(r,t) = w T (r)b(t) — ^ ic m (r)6 m (t), (1) 

m— 1 


where s(r,t) is the estimated source magnitude. The column vector w(r) expresses a set of the filter weights, which characterizes 
the property of the spatial filter. Various types of spatial filter techniques have been proposed and applied to the MEG/EEG source 
reconstruction problems. 

The measured data can be expressed as 

b(t) = bi(t) + b$(t) + n(£), (2) 

where b^(t) is the magnetic field generated from sources other than the signal sources, such as brain spontaneous activities or some 
evoked activities that are not the target of current investigation. This b^(t) is often referred to as the brain noise. The problem with 
the parametric modeling described in the preceding section is that it cannot efficiently take the brain noise into account, because 
the Gaussianity assumption does not hold for b^(t). Here, we propose a simple nonparametric method that can take such brain 
noise into consideration. The spatial filter outputs obtained from b(t) is expressed as 


s(r, t) = w 1 (r)bi(t) + w 1 (r)b^(t) + w T (r)n(t) = sj(r, t) + s c (r,£), 


( 3 ) 


where 

si(r,t) = m T (r)6/(£) and s c (r,t) = w T (r)(b^(t) + n(t)) (4) 

The key assumption in the proposed method is that the control measurement that can provide b c (t) — b%(t) + n(t) is available. 
Using this control measurement, the proposed method first derive an empirical distribution of s" c (r,£), and with this empirical 
distribution, the method determines the statistical threshold. The proposed procedures are described as follows: 

(i) Empirical distribution formation We calculate Sc(r,U), by applying the spatial filter to the control measurement 

b c (tj ) where tj is the discrete time point in the control measurement. We then calculate F(x), which is the empirical distribution 

of the modulus of the time course \s c (r,tj)\, such that F(x) — (t{|sc(V, tj)\ — x}/K c where t){|s' c (r, tj)\ < x} indicates the number 
of |sc(r,U)| which is less than or equal to x, and K c is the number of time points in the control measurement. This procedure is 

repeated and the empirical distribution is calculated at all pixel locations. Since F(x) is obtained at each pixel location r, F(pc) is 
rewritten as F(x\r) in the following explanation. 
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(ii) Statistical thresholding with multiple comparisons The proposed method uses the maximum statistics to incor¬ 
porate the multiple comparison problems; the use of the maximum statistics has been studied in [Pantazis,2003] [Nichols,2001]. To 
utilize the maximum statistics, we first studentize the empirical distribution of |s" c (r, tj)\ by calculating T(r,tj ) such that 


T(r, tj) = 


I s c {r,tj)\ - (|Sc(Mj)|), 
a(r) 


where a 2 (r) = { s c (r,tj) 2 ) c - (\s c {r,tj)\) 2 c , 


( 5 ) 


and (-)c indicates the time average over the control period, i.e., 


(3c(r , tj) 2 ) c 



kI 


K c 


3 = 1 


We then calculate the maximum T value T max (r) at each pixel location. The maximum T value at the 7th pixel location is denoted 
T^nax where i = 1,..., ifjv and Kn indicates the total number of pixels. We next obtain the empirical distribution of T^ acc , H(x), 
such that H(pc) = ^{Tmax < x}/Kn where (J{ T ^ aa; < x} is the number of T^ aa , values which is greater than or equal to x. We can 
then obtain the threshold for the a-confidence level, T ^ ax , such that T^ ax = i7 _1 ( 1 — a). The inverse of this empirical distribution 
can be calculated by first sorting T^ aa . in the increasing order 


7 ^( 1 ) rp{2) / / rp(K N ) 

max — -*■ max _ ‘ ‘ ‘ _ max 5 



and choose T^ax as T^ ax where p 
T(r), by using 


[(1 — ol)Kn\- We hnally obtain the statistical threshold for the spatial-hlter reconstruction, 


E(r) = T^ ax $(r) + (|s c (r, tj)\) c . (7) 

We evaluate the statistical significance of the spatial filter outputs by comparing the outputs |T(r, t)\ with T(r), and when |T(r, t)\ > 
T(r), the outputs s(r,t) is considered to be statistically significant. 


EXPERIMENTS 


We applied the proposed method to auditory-evoked MEG data to test its effectiveness. The auditory-evoked fields were measured 
using the 275-channel Omega-275 ™ whole-cortex biomagnetometer. The auditory stimulus (1-kHz pure tone) was presented to 
the subject’s right ear. A total of 400 epochs were measured for averaging. The eigenspace-projected adaptive spatial filter 
[Sekihara,2002] was applied to these averaged recordings. The snapshots of the source reconstruction results at 44 ms is shown in 
Fig. 1 (a). The reconstructed results not only contain a clear localized source in the left temporal lobe probably near the primary 
auditory area but also contain another diffused activity. The time-averaged reconstruction obtained over the whole prestimulus 
period is shown in Fig. 1(b). These results contain a diffused source similar to that found in the snapshot at 44 ms, suggesting that 
these diffused sources were caused by the periodic background activities. The results of applying the proposed method are shown 
in Fig. 1(c). The diffused source activities contained in (a) have been removed in (c),demonstrating the effectiveness of the the 
proposed statistical thresholding for removing the influence of background source activities. 





Figure 1: (a)Snapshot reconstruction at 44 ms. (b)Prestimulus-average reconstruction. (c)Results from proposed statistical 
thresholding. 
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ABSTRACT 

Knowledge of tissue conductivities is needed to construct reliable volume conductor models of the human body and the head in solving forward 
and inverse bioelectric field problems. In this study three different estimation algorithms are utilized to estimate human head tissue conductivity 
distribution in vivo. The applied algorithms are conventional Least-Squared Error Algorithm (LSEE), Bayesian maximum a-posteriori (MAP) 
Algorithm and statistically constrained minimum mean squared error estimator (MiMSEE). The algorithms intake a priori information on body 
geometry, statistical properties of regional conductivities, linearization error and instrumentation noise. The EEG-MEG data set has been obtained 
from a source localization experiment in which the median nerve has been stimulated. The anatomical boundary information has been extracted from 
256 T1-weighted MRI images. The MEG data have been obtained by using a 31-channel magnetometer over the somatosensory cortex. By using the 
data, scalp, skull and brain conductivities have been estimated and estimation variances are calculated by using the algorithms. It is shown that the 
MiMSEE algorithm has given the lowest error rates. The calculated error rates are 90% for the LSEE, 20.5% for the Bayesian Map estimator and 
12.5% for the MiMSEE. 
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INTRODUCTION 

There have been many studies on determining the biomagnetic properties of the human body, measuring the resistivities of the tissues and 
observation of the physiological activities. In these studies, electrical and magnetic field distributions are constructed by an electrical stimulus. 
Tissue conductivities are one of the factors that alter the field distributions. 

Knowledge of tissue resistivities is needed to construct reliable volume conductor models that are used in many areas such as forward/inverse 
problem solutions, source reconstruction in functional brain imaging [Awada, 1998], calculation and experimental estimation of electromagnetic field 
energy caused by the use of RF energy source [Okano, 2000], spatial temperature estimation of subsurface skin compartments under hyperthermia 
treatment [Paulsen, 1997], detection of lesions [Netz, 1993]. A volume conductor is a three dimensional (3D) body in which an electric field can be 
produced by an electrical stimulus. In this study, we aim at the estimation of the conductivity distribution from combined bioelectric (EEG) and 
biomagnetic (MEG) measurements, non-invasively. This is believed to yield more accurate forward and inverse problem solutions, which improves 
source localization accuracy. 

THEORY 


In a purely resistive, source free volume conductor, under quasi-static conditions, the relation between the electric potential distribution ^resulting 
from a low frequency electric field and the conductivity distribution <j(x,y,z) is given as, 


a.V 2 (/) + Va.V</> = 0 


( 1 ) 


If cr(x,y,z) and the boundary conditions on the surface are provided, the electric potential distribution can be determined uniquely. This is known as 
the forward problem and c|)(cr) is nonlinear. 

The inverse problem is ill-conditioned so one may reduce the problem into finite dimensions and linearize it when regional resistivities have their 
statistical mean values. Then the electrode potentials can be expressed in the following form: 

v' =M<T + Tj 0 +q n + j n (2.1) 

v = v ~bo ( 2 - 2 ) 

v = M<y + q, 1 +j n (2.3) 

Here, v' is the measurement vector, a is the unknown conductivity vector and tj 0 is the offset vector representing the error between true and 

estimated measurements when the resistivities of the regions are equal to their mean values. The offset can be separated and v (linearization offset 
free measurements) can be expressed as a linear function of a (the unknown regional conductivity vector). In the final form, M is the forward transfer 
matrix, q ^ is the error resulting from linearization and j ^ is the instrumentation noise. The j ^ is inherent to the measurements and rj 0 and q ^ are 

created by the linearization process. The matrix M possesses information about geometry, which incorporates surface information and 
electrode/sensor positions. The calculation of these terms requires successive solutions of the forward problem for different conductivity 
distributions. Calculation of these matrices are found in an earlier study [Baysal and Eyiiboglu 1999]. 

The Minimum Mean Squared Error Estimator (MiMSEE) algorithm can be expressed as follows [Lewis 1971, Baysal and Eyiiboglu 1999]: 

(7 = Bv + b (3.1) 

B = SM t (MSM t + Q n + J n )-‘ (3.2) 

b = (cr) - BM (cr) (3.3) 

where (•) is the expected value operator; S, Qr| and Jr| are covariance matrices of regional resistivities, quantization noise and instrumentation noise, 
respectively. The estimation error covariance matrix can be calculated as: 


-l 


-i \ -l 


C e = (M C M + C ) 


(4) 
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In the Bayesian MAP estimator, the parameter vector is assumed to have a statistical distribution of (CJ ~N( cr, C a )), and also lets assume that the 

noise is Gaussian with zero mean and known variance C n (n ~ N(0, C n )). Under these assumptions the Bayesian MAP estimation can be obtained by 
the following formula [Kay, 1993]: 

<J = (M T c; 1 M + c-J ) - 1 (M T c; 1 V+c; 1 a) (5) 

Estimation error covariance matrix can be calculated as: 

C e = (M T C n ~ 1 M + C a ~'y 1 . (6) 

The conventional Least Square Error Estimator and its error covariance matrix for the linearized equation given in equation (2.3) can be given as 
follows [Lewis, 1971]: 


6 = (M‘ M)~ l M t v 

c e =(m t m y l 


( 7 ) 

( 8 ) 


MEASUREMENTS AND RESULTS 


The measurement data have been obtained during SEF/SEP experiment involving electrical median nerve stimulation of one subject. The estimation 
has been realized for the three algorithms stated in the previous section and tabulated in the Table-1. The measured data (v) are the raw fusion of 
EEG and MEG measurements and estimates of the conductivities are computed according to equations (3.x,5,7). For comparison, the standard 
deviations of the estimations are calculated by using diagonal elements of the matrices calculated in the equations (4,6,8); and the error rates are 
calculated by the equation (9). 


error 


rate = 1 OOx 




Table - 1: The results of the algorithms by using in vivo EEG-MEG data. 



Conductivity 

(S/m) 

Standard deviation 

Error Rate 

(%) 

LSEE 

MAP 

MiMSEE 

LSEE 

MAP 

MiMSEE 

LSEE 

MAP 

MiMSEE 

Scalp 

0.6230 

0.2898 

0.3130 

0.5641 

0.0595 

0.0391 

90.6 

20.5 

12.5 

Skull 

-1.3001 

0.0187 

0.0150 

0.1210 

0.0052 

0.0021 

-9.3 

27.8 

14.0 

Brain 

-9.5103 

0.2788 

0.3631 

0.9514 

0.0580 

0.0741 

-10.0 

20.8 

20.4 


CONCLUSIONS 

In this work, human head tissue conductivities are estimated in vivo by using three different algorithms. The conventional LSEE yields 
unrealistic negative conductivity values for skull and brain. The most appropriate algorithm for the problem considered in this work is the MiMSEE 
algorithm. It estimates scalp conductivity by an error rate of 12.5%, skull conductivity by 14% and brain conductivity by 20%. Bayesian MAP 
estimation yields 20% error rates on the average. Because of that, MiMSEE yields better results than the other estimation algorithms for the given 
problem. We aim to repeat the same study on more subjects. 
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A Two-objective Hybrid Algorithm for Source Localization in MEG 
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ABSTRACT 

We describe an optimization method to localize multiple simultaneous sources in Magnetoencephalography (MEG) based on a genetic algorithm 
(GA) of the multi-object type combined with a local search method, lending some weight to the proximity of one of the sources to a given deep brain 
structure. We have adopted a conventional GA though some improvements have been made in methodological decisions and parametric choices to 
improve its performance. To evaluate the performance of our algorithm, we constructed simple noisy and noiseless simulation examples of equivalent 
current dipoles (ECD’s) in a spherical head model. In the worst case (number of dipoles=4, NSR=10%) the percent of success is 100% and time 
consuming is about 30 minutes, 50% more than in a noiseless case for 4 sources. GA has shown to be robust and able to escape from local minima, 
finally converging to the global solution. The addition of local search makes the performance of the algorithm faster and more accurate 

KEYWORDS: Magnetoencephalography, Inverse Problem, Genetic Algorithm, Global Optimization. 

INTRODUCTION 

MEG (Hamalainen et al. 1993) has demonstrated its efficiency in dealing with superficial brain sources modeled as electric current dipoles, 
provided that the regions of activation are relatively focused. The dipole location, orientation, and moment parameters are determined by fitting the 
measured data with a non-linear minimization procedure. Due to the existence of many local minima and the properties of various minimization 
techniques, minimization is usually very sensitive to the initial guesses when the number of modeled dipoles is grater than one (Supek and Aine, 
1993). Manually selecting initial guesses is a time-consuming procedure and if a number of different starting parameters are not attempted, the 
calculation may be trapped in the local minima. Therefore it is necessary to find an automated procedure to effectively handle the multiplicity of local 
minima. In this work we show that the hybrid algorithm can effectively approach the global minimum with no need of pre-selected initial guesses. 

Besides the consideration of cortical activity mentioned above, the study of sources in deep areas, such as the thalamus, is of great interest. These 
sources are difficult to detect using standard methods, due to extreme weakness of neuromagnetic fields (~ 100 fT) and the fact that sensitivity of 
MEG declines rapidly with increasing source depth. Two factors are responsible for this loss of sensitivity. First, the amplitude of the measured MEG 
signal is inversely proportional to the squared distance between sensors and source. Second, the MEG amplitude rapidly decreases when the source 
approaches the centre of the head, due to the spatial distribution of the secondary current. In addition, the existence of simultaneous sources in most 
cases makes more difficult the exact localization of the sources. However has been shown that activity in deep brain structures, e.g., brainstem or 
thalamus, is indeed experimentally accessible by MEG. 

Genetic algorithms are able of searching for global optimum in functions which cause difficulty for gradient based methods. They have been 
successfully applied to many non-linear, multi-peak, continuous or combinatorial optimization problems. Principal advantages of GA are domain 
independence, non linearity and robustness (Goldberg, 1992). The only requirement for GA is to calculate some objective function, which may be 
highly complicated and non-linear. GA has been previously applied to the problem of MEG source localization with limited scope and good results 
(Jiang T, 2003). An additional advantage that makes GA a very promising technique for MEG source location is its flexibility. It can easily 
incorporate other objectives of the optimization by appropriate extension of the fitness function (Li et al., 2000), in order to weight certain 
characteristics of the source, such as its proximity to a given brain region. 

METHODS 

For the sake of simplicity we assume the sources modeled as ECD’s in a spherical and homogeneous conductor. In this case the magnetic field B at a 
point r due to a single dipole Q located at r 0 is given by (Sarvas, 1987): 

B(r) = T° 2 E Q x r 0 - (Q x r 0 -r )Vf] 

4 TIP 

r\ 

where F = a(r • a + r - r 0 • r ) , and a = r - r 0 

For the inverse probleme a conventional GA is adopted with some modifications. Each 
source is expressed by its genotype as a string of binary bits that represent the position and 
the components of dipolar moment (phenotype). In our case we have used 51-bit strings to 
represent each source. 

After the phenotype is determined we evaluate the fitness function. We have used two 
variants for this function. The first one has the following form F= f(e), where e is the 
normalized least square fit error and f a monotonic continuous function that maps the 
margin of error into the interval (0, 1). In the second one we consider a two-objective fitness 
function taking into consideration both f(e) and the distance to a zone of interest, f(d). 

F = k f(e) + (1-k) f(d) 0 <k< 1 

In case k= 1, we are only using the root mean square error in our fitness function, in other 
cases we are giving some weight to the RMS error and the complementary weight to the 
distance to some specific target areas of neural activity. 

Conventional GA are very expensive in terms of computational cost, thus we have use 
tournament and elitist preserving selection to accelerate the convergence. Elite set is 
variable from 0% to 10 % of the whole population, at the beginning of the process and at 

the end, respectively. Fig. Hybrid Algorithm Method. 
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In addition we use a downhill simplex method (Nelder and Mead, 1965) for minimizing the cost function, because of the poor performance of GA 
in local searching, to enhance its effectiveness. We have adapted the local search procedure depending on the fitness function of the source, giving 
the possibility of searching far if the fitness function of the source has a low value, and near if the fitness function is good enough. Besides we only 
use local search every certain number of generations and if the cost function has a good enough value. 

RESULTS 

To evaluate the performance of our algorithm, we constructed simple both noisy and noiseless simulation examples of dipole sources in a 
spherical model of head using Sarvas’s equation. The number of sources varies from 1 to 4. Three different cases have been considered for the noise 
level, corresponding to 0%, 5% and 10% relative to the signal. Each case has been performed 50 times using a PC Pentium IV at 2.4 GHz. The 
average results are shown in table 1. For one source we only consider the one-objective function. In two, three and four sources cases, the 
configuration always includes a deep source (5 cm. depth). 

The size of initial population has been adjusted for every case to the minimum size to obtain almost 100% of success with the least computation 
time. We consider a “success” when the distance between the real data and the solution provided by the algorithm is less than 1 mm. For simple 
Genetic Algorithm (SGA) and four sources success is lower. 


Noise level 

SGA 

Hybrid GA 

0 % 

Pop. size 

100 

5 

Time 

11” 

0.3” 

5 % 

Pop. size 

150 

5 

Time 

18” 

0.5” 

10 % 

Pop. size 

150 

5 

Time 

23” 

1.2” 


(a) (b) 


Noise level 

SGA 

Hybrid GA 

Two-objective 
Hybrid GA 

0 % 

Pop. size 

400 

50 

50 

Time 

10’ 47” 

24” 

26” 

5 % 

Pop. size 

500 

60 

60 

Time 

12’ 26” 

V 12” 

V 20” 

10 % 

Pop. size 

700 

60 

60 

Time 

23’ 54” 

V 27” 

1’35” 


Noise level 

SGA 

Hybrid GA 

Two-objective 
Hybrid GA 

0 % 

Pop. size 

2000 

250 

250 

Time 

3h 10’54” 

27’ 45” 

20’36” 

5 % 

Pop. size 

2500 

300 

300 

Time 

4h 5” 34” 

31’ 27” 

22’ 44” 

10 % 

Pop. size 

2800 

350 

350 

Time 

4h. 30’ 23” 

30’52” 

27’ 46” 


Noise level 

SGA 

Hybrid GA 

Two-objective 
Hybrid GA 

0 % 

Pop. size 

700 

100 

100 

Time 

1h 32’ 45” 

3’ 47” 

2’ 26” 

5 % 

Pop. size 

1000 

130 

130 

Time 

lh 55’ 9” 

4’ 18” 

2’ 43” 

10 % 

Pop. size 

1000 

140 

140 

Time 

2h 50’ 11” 

4’ 20” 

2’ 40” 


(c) (d) 

Table 1 . Results for one (a), two (b), three (c) and four (d) sources. Cases (b), (c) and (d) have one deep source and the rest are cortical. We are using 
k= 0.7 for the two objective function. 


DISCUSSION 

We have introduced a hybrid algorithm, (GA with local search techniques) in order to overcome inconveniences of the conventional GA. We have 
shown that the addition of local search makes the algorithm faster and more accurate. The implementation of a two-objective function makes it a 
good procedure to localize deep sources when there are other cortical sources present, especially when three or four sources are simultaneously 
active. For the two sources , the use of the one-objective or the two-objective function is not relevant the smaller number of iterations (generations) 
needed is compensated by an equivalent increase of computation time. However for the three and four sources the two-objective hybrid GA improves 
the efficiency in terms of global time calculation. 
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How many channels are needed for MEG? 
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ABSTRACT 

Channel count in modem MEG systems has been steadily increasing, but are more channels necessary? Assuming that the spatial sampling 
considerations are satisfied, this question can be answered by examining the MEG system’s ability to localize and resolve brain sources. For the 
simple situation where only uncorrelated sensor noise is present, dipole localization accuracy monotonically increases with increasing number of 
channels, while for correlated brain noise the accuracy increases only until the number of channels reaches 100 to 200. Beyond this limit the inter¬ 
channel separation is comparable to the brain noise correlation distance and increasing the channel count does not help. Contrary to the above dipole 
result, we show by simulations with up to several thousand channels, that if the data is analyzed by beamformers even in the presence of correlated 
brain noise, the two-source resolvability and single-source localization accuracy monotonically improve with increasing number of channels. We 
demonstrate such behavior for a 275 channels system, where we have inserted an artificial dipole into real measured brain noise and resampled the 
number of channels to 138. Beamformer analysis of the data shows markedly improved localization accuracy when the number of channels is 
increased from 138 to 275. This finding also signifies that the beamformer performance is not limited by system imperfections when the number of 
channels is as large as 275. To clarify these results, we illustrate analytically the mechanism of beamformer resolution dependence on the number of 
channels using an example of a simple system containing two dipole sources and uncorrelated sensor noise. 

KEY WORDS 

Magnetoencephalograpy, MEG, channel density, beamformers, synthetic aperture magnetometry, SAM 

INTRODUCTION 

Answer to the question about how many channels are needed for MEG ultimately depends 
on what we want to measure, character of the noise present in the system, and on how we want 
to analyze the data. We first examine the effect of noise on the MEG dipole localization 
accuracy and review spatial sampling requirements. Then we shall describe simulations of the 
beamformer behavior as a function of the number of channels and demonstrate the results on 
actual 275 channel MEG measurement. We conclude the discussion by a simple analytical 
example which provides insight into the beamformer behavior when the number of channels is 
changed. 

METHODS AND RESULTS 

For uncorrelated sensor white noise, the required number of channels is determined by 
spatial sampling arguments [Ahonen, 1993] and by dipole localization accuracy requirements 
[Vrba, 2002]. Sampling considerations suggest that the number of channels, M, should be large 
for superficial sources and for high signal-to-noise ratio (SNR), while localization error 
reduces with increasing number of channels (as 1 Hm for a single equivalent current dipole 
(ECD)). 

The brain noise is correlated among channels and the above simple arguments do not 
apply. The ECD localization errors derived by Monte-Carlo simulations of brain noise are 
shown in Figure 1. It was found that over a wide range of ECD positions, sensor types, and 
sensor shell radii, and for the number of channels M > 100, the standard deviation of the 
localized ECD positions becomes independent of the number of channels [Vrba, 2002] and can 
be approximated by a « P c v rms / Sp ea k •> where o is rms value of the standard deviations of 
the localization accuracy along three orthogonal directions, v rms is the rms value of the brain 
noise, S peak is the peak signal corresponding to the investigated ECD, and the constant of 
proportionality ~ 2.6 cm. The saturation of the ECD localization error for large number of 
channels corresponds to the channel density at which the inter-channel spacing becomes 
comparable or shorter than the brain noise correlation distance (about 3 cm). 

When the MEG signals are analyzed by beamformers (SAM) [Robinson, 1999], it is 
shown that significantly larger number of channels may be beneficial than that indicated in 
Figure 1. The SAM Z 2 peak width and resolution of two sources in simulated brain noise is 
shown as a function of the number of channels in Figure 2 (Z = PIN, where P is the power 
and N is the sensor noise projected by SAM). The peak width monotonically decreases 
(indicating monotonically improving resolution) and the saturation has not been reached up to the maximum computed number of channels of 2000. 
Similarly, resolution of two sources also monotonically improves up to the largest considered number of channels. The resolution of two ECDs is 
poorest when they are parallel, it dramatically improves when the angle between them increases, and is best for perpendicular ECDs. 

Simulations in Figure 2 were performed assuming a perfect MEG system. At a certain level of imperfections, e.g., inaccurate relative sensor gains 
or inaccurately known sensor positions, the real MEG performance will become worse than the SAM simulations. We have determined that this limit 
has not yet been reached by the existing CTF 275 MEG systems. We have measured brain noise and added a synthetic source to it. We have then, 
roughly uniformly, spatially resampled the 275 channel system to 138 channels. Using the original and resampled data, we have constructed SAM Z 2 
volumetric image of the synthetic source and constructed 3D contours of full-width-half-maximum (FWHM) of the Z peak. The contours for 275 
and resampled 138 channel systems are shown in Figure.3. 



Figure 1. Standard deviation of the ECD 
localization error normalized by rms noise and 
peak signal in the presence of brain noise 
simulated by randomly oriented and randomly 
distributed 1000 ECDs throughout the model 
sphere volume. Hemispherical sensor shell with 
radius of 10.7 cm, dipole magnitude q = 10 
nA.m, dipoles at distances of a = 5 and 7 cm 
from the model sphere center. The sensor types 
and the corresponding brain noise amplitudes 
were: radial magnetometers, 30.4 fT rms; 
tangential magnetometer, 17.2 fT rms; radial 
gradiometers with 5 cm baseline, 24.2 fT rms; 
radial gradiometers with 8 cm baseline, 27.1 fT 
rms; planar gradiometers, 11.5 fT rms. 
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Reduction of the number of channels from 275 to 138 
degrades the spatial resolution and dramatically increases the 
dimension of the 3D contours in Figure 3. The 275 channel 3D 
contours are reasonably well centered on the true dipole 
position, however, when the number of channels is reduced to 
138, the center of gravity of the contours moves away from the 
true dipole position. 

The SAM spatial resolution improvement by increasing the 
number of channels can be illustrated for a simple model of 
target, qi(t), and interferer, q 2 (t), in a model sphere when only 
uncorrelated sensor noise is present. When the SAM is 
positioned on the target, the synthetic channel output, s(t), 
(without the noise term) is 


s(t)« qi{t)+ q 2 {t) 


B 


cos A 


B 


M ■ SNR 2 \1 - cos 2 Z 


(i) 


where and B 2 are the target and interferer forward solution 
vectors, Z is the angle between them, and SNR 2 is the interferer 


SNR given by SNR 2 = q 2 


B 


2 /Mv 2 

/ ±V± V rms 


[Vrba, 2002]. Eq.l 


illustrates that the interference is inversely proportional to M and 
the SAM resolution is expected to improve with increasing M 

DISCUSSION 

Spatial sampling considerations and dipole localization 
accuracy lead to the requirement that the number of channels 
should be in the range from about one to several hundreds. 
However, beamformer simulations indicate that even in the 
presence of brain noise, the MEG systems can benefit from 
significantly larger number of channels. We have tested this 
finding by spatially resampling 275 channel MEG system. The 
result is consistent with the simulation predictions and indicates 
that the 275 channel MEG system is adequately accurate such 
that the limits of beamformer performance are not reached and 
the increase of the number of channels is expected to increase the 



E 

o 



0.02 

0.01 


100 200 500 

Number of channels, M 

Figure 2. Peak width and two source resolution for SAM beamformer in the 
presence of brain noise simulated by 1000 dipoles randomly distributed 
between the radii of 6 and 8 cm and with random orientation and random 
amplitude uniformly distributed in the range from 0.5 to 3 nA.m. Sensor shell 
is a spherical segment extending from vertex to 3xc/4 and with radius =11.2 
cm. Sensors were radial gradiometers with 5 cm baseline, roughly uniformly 
distributed over the sensor shell. The considered number of channels was 72, 
100, 140, 162, 200, 250, 390, 698, 1212, and 2025 (with corresponding inter¬ 
channel separations of 4.80, 3.92, 3.19, 3.08, 2.73, 2.40, 1.94, 1.46, 1.10, and 
0.84 cm); sensor white noise level was 5 fT rms/VHz. (a) Single source was 
added to the simulated brain noise. Depth profile through the source was 
computed by SAM Z 2 and FWHM of the peak is plotted as a function of the 
number of channels for dipole magnitudes of 3, 5, and 7 nA.m; (b) Two 
dipoles were positioned symmetrically about the sensor shell axis on an arc 
with radius a = 9 cm. The dipoles were parallel, equal amplitude q = 10 nA.m, 
and inclined at 0, 10, 20 deg, or perpendicular. SAM was scanned along the 
arc on which the dipoles are located and the dipole separation, d, at which 
they were resolved was computed and is plotted as a function of the number 
of channels. Dipoles were considered resolved if the SAM scan along the arc 
through the two dipoles exhibited a dip between the dipoles. Gray region 
corresponds to all dipole orientations. 

spatial resolution. 



8 10 
X2 (mm) 


8 10 12 
Xi (mm) 


8 10 

(mm) 

Figure 3. Effect of spatial resampling on SAM beamformer spatial resolution, (a) Brain noise was measured in unshielded environment by CTF 275 
MEG system (VSM MedTech Ltd., Coquitlam, BC, Canada). Signal corresponding to an ECD with 20 nA.m magnitude and positioned about 4.5 cm 
below the sensor array was added to the measured brain noise. Dipole orientation was parallel to the prevailing local brain noise source orientation. 
SAM beamformer was used to compute volumetric Z 2 image (ratio of projected power to the projected sensor noise), and 3D contours of FWHM 
were constructed. Then the 275 channel sensor array was roughly uniformly resampled to 138 channels and the same SAM Z 2 image was constructed 
using the identical data, (b, c, d) 3D contours of SAM Z 2 images corresponding to the original 275 channel and the resampled 138 channel sensor 
arrays. White ‘+’ indicates the true source position, dark region corresponds to the 275 channel 3D contours, and the light gray region to the 138 
channel 3D contour; (b) jci-jc 3 view of the 3D contours; (c) x 2 -x 3 view; (d) x\-x 2 view. 
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ABSTRACT 

We propose a number of current source models that are spatially distributed on a line or surface using magnetoencephalography (MEG). We develop such 
models with increasing degrees of freedom, derive forward solutions, maximum likelihood (ML) estimates, and Cramer-Rao bound (CRB) expressions for the 
unknown source parameters. We compare the proposed line- and surface-source models with existing focal source models, and show their usefulness for certain 
biomagnetic experiments. We apply our line-source models to N20 response after electric stimulation of the median nerve known to be an extended source and 
our surface-source models to KC1 (potassium-chloride) induced spreading depression. 

KEYWORDS 

Magnetoencephalography, Extended source modeling, N20 responses. 

INTRODUCTION 

We propose several line- and surface-source dipole models for MEG and investigate their aspects. For each of the proposed models, we give the forward 
solution, derive the maximum likelihood (ML) estimates and Cramer-Rao bounds (CRBs) for the unknown parameters. We also discuss the special case of 
the spherical head model and radial sensors which results in more efficient calculations. We apply our models to N20 responses and KC1 induced spreading 
depression. 

METHODS 

Let b{r,t) be the magnetic field induced by a focal electrical source j(r,t) = q(t)5(r — p) using a realistic head model (Muravchik [2001]), where q(t) 
denotes the dipole moment, p the source position, r the position, and t the time. Assuming ra MEG sensors the measurement model can be written as 
y{t ) = A(O p )q(t) + e(£), where y{t) is a vector of dimension rax 1 of the measured magnetic fields, e{t) is additive noise, Op = [p, cf),p\ is the vector 
of source position parameters with p denoting the azimuth, 0 the elevation p the distance from the origin, and A(6 P ) is a gain matrix of dimension rax 3. 
Considering this model with K independent trials (e.g. evoked responses), N temporal samples, and assuming the source position is fixed in time, the ML 
estimate of 0 P is Q p = argmin^ ~y(t) T P(0p)y(t)\ , where y(t) = (1 /K) 7/ fc (t), Vk(t) denotes the measurement vector for the kth trial, 

and P(0p) = A(0p)[A(0 p ) T A(0 P )] -1 A(0 P ) T . The ML estimate of q(t) can be calculated using the Moore-Penrose pseudoinverse (Dogandzic [2000]) as 

a rn a -i a f-p 

q(t) = [A(Qp) A(Qp)] ~ A{0p) y(t). We derive the CRB which is a lower bound on the covariance of any unbiased estimator, and is asymptotically achieved 
by the ML estimator. It is an important performance measure that can be used to evaluate the statistical efficiency of estimation algorithms, to determine the 
main regions where good and poor estimates are expected, and to optimize the sensor system design. See (Yetik [2004]) for details for the line-source case. 
Line-Source Models: We give the source currents for the three line-source models that we propose: variable-azimuth constant moment (VACM), variable- 
position constant moment (VPCM), and variable-position variable-moment (VPVM). 

VACM: The source current is q{t)5{p — p 0 )5(cf) — cj) 0 )[u(p — pi) — u(p — p 2 )} with q(t) = [q x (t), q y (t ), q z (t)] T , where 0o is the fixed elevation of the 
source, pi and p 2 the limits of the azimuth extent of the source, and u(-) the unit step function. 

VPCM: The source position is [p x (s),p y (s),p z (s)] T = WA*(s) resulting in the source current q(t) for r = [p x (so),p y (so),p z (so)], so e [si,S 2 ] and 0 
otherwise, where W is the matrix of unknown coefficients that determine the position of the source, \P(s) is the set of known basis functions, p Xj y jZ (s) denotes 
the position of the source in cartesian coordinates, and si and S 2 determine the extent of the source. 

VPVM: The source moment density varies with position resulting in the source current q(s , t) = X(t)~/(s) for r = [p x (s),p y (s),p z (s )], s e [si, S 2 ] and 0 
otherwise, where X is the matrix of unknown coefficients that determine the spatial variation of the dipole moment and 7 (s) is the set of known basis functions. 
Surface-Source Models: We present the source currents for the three surface-source models that we propose: constant-radius constant moment (CRCM), 
variable-position constant moment (VPCM), and variable-position variable-moment (VPVM). 

CRCM: The source current is q(t)S(p — p 0 )[u(cj) — 0i) — u(0 — <^ 2 )] [u(p — pi) — u(p — p 2 )}, with q(t) = [q x (t ), q y (£), q z (t )] T • 

VPCM: The source position is [ p x (s a , s b ),p y (s a , s b ),p z (s a , s b )] T = s b ), (s a , s b ) e [s a i, s a 2 ] IT [s&i, s b2 ], resulting in the source current q(t) for 

r = [p x (s ai s b ),p y (s a , S b ),p z (s a , s b )}, (s a , s b ) e [s a i, s a2 ] H [s bl ,s b2 ] and 0 otherwise. 

VPVM: The source moment density varies over the surface resulting in the source current q(s ai s bl t) = X(t)j(s a , s b ) 

for r = [p x (sai s b ),p y (sa, s b ),p z (sai s?,)], (s a , s b ) e [s a i, s a 2 ] fl [sbi, <^ 2 ] and 0 otherwise. Using these source currents the magnetic field can be calculated 
by integrating the magnetic field induced by a dipole. This integral will be a line integral for the line-source models and surface integral for the surface-source 
models, the details can be found in (Yetik [2004]). 

Spherical Head Model: The special case of a spherical head and radial sensors results in more compact forms of induced magnetic field involving elliptic 
integrals (Byrd [1954]). See (Yetik [2004]) for more details for the line-source case. 

Low-rank Gain Matrices: The radial components of the dipole sources do not produce magnetic fields outside a spherical head. Therefore, the gain matrix 
for the focal source model has a rank equal to two when a spherically symmetric head model is used. A similar situation exists for the proposed line- and 
surface-source models under certain conditions. These conditions are derived in (Yetik [2004]) for the line-source case. 

RESULTS 

Distinguishing Between Line-Source and Focal Source Models: We first investigate when it is possible to distinguish between a line source and a focal source 
using the Neyman-Pearson hypothesis test. A collection of ROC’s for different source lengths is given in Figure la using VAVM. A similar plot is given for 
VPVM in Figure lb. Selecting Lb = 0.9 and Pf = 0.1 as the boundary of a confident decision, the minimum source length which is distinguishable from a 
focal source is 1.81cm for VAVM, and 1.69cm for VPVM. 

Application to N20 response: We applied the VPVM model to real data of N20 response after electric stimulation of the median nerve. The resulting N20 
generator is known to be an extended source along the wall of the central sulcus, which is mainly one-dimensional and a good example where line-source models 

Uhe work of I. S. Yetik and A. Nehorai was supported by the National Science Foundation Grants CCR-0105334 and CCR-0330342. The work of Carlos H. Muravchik was 
supported by CIC-PBA, UNLP and ANPCTIP of Argentina. 
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Figure 1: Parts (a)-(b): Receiver operating characteristics for different source lengths (denoted by A) for VAVM (part (a)), VACM (part(b)) and focal source 
models. Part (c): Estimated source positions using the focal source model and VPVM. The symbol “+” shows the estimated position of the focal source, and 
the line shows the estimated position and extent of the line-source. 



Figure 2: Spreading depression on the rat brain. Part (a) is for t = lOsec after spreading depression started, (b) 250 sec, (c) 500 sec, and (d) 750 sec. 


are needed (Nolte [2000]). The experiment was performed in eight healthy volunteers. The MEG data were recorded over the contralateral somatosensory 
cortex with a 31 channel biomagnetometer (Philips, Hamburg, Germany). Constant current of 0.2 ms square-wave pulses were delivered to the right or left wrist 
at a stimulation rate of 4 Hz. We have used realistic head models when applying our models to N20 data. The estimated source positions (for the focal source 
and VPVM models) are plotted in Figure lc for one of the subjects. We also compare the source extents (path length of the source curve) and mean-squared 
errors for all eight subjects in Table 1. 

Estimating source extents using different noise realizations: We conducted another simulation to show that the estimated source extents are the real extents of 
the sources rather than a misleading result of the brain noise. We considered a straight line-source 'ip(s) = [1, s] T (which is close to the estimated line-source of 
one of the subjects), and calculated the exact induced magnetic field intensity using this source. We then added different noise realizations using the real brain 
noise samples before the stimulation. We observe that reasonably close estimates for the source extent are obtained and the standard deviation of the estimated 
source extent is 0.47cm with a mean of 1.73cm. 

Application to KC1 induced spreading depression: We applied the surface VPCM model to real data of KC1 induced spreading depression using five date 
sets taken from three rats. The MEG data were recorded with a 4 x 4 array of first order gradiometers. During spreading depression a depolarisation wave is 
traveling over the cortex with an increasing extent providing an example where surface source models are necessary. The estimated source positions at different 
time instants for one of the data sets is given in Figure 2. 



Subject A 

B 

C 

D 

E 

F 

G 

H 

Focal MSE (xl0“ 25 T 2 ) 

1.74 

3.19 

1.91 

4.41 

0.98 

1.25 

1.54 

2.26 

VPVM MSE (xlCT 25 T 2 ) 

1.24 

2.61 

1.24 

3.58 

0.70 

1.03 

1.43 

1.34 

Error Decrease 

29% 

18.3% 

35.4% 

18.8% 

28.7% 

17.1% 

6.9% 

40.5% 

Est. Source Length (cm) 

1.56 

0.94 

1.99 

1.07 

2.27 

1.32 

0.62 

2.44 


Table 1: Estimated source lengths and estimation performance results for real MEG data of eight subjects. 

CONCLUSION 

We have proposed several source models for MEG which are spatially distributed on a line or surface, and applied these models to real MEG data for N20 
response in humans and spreading depression in rats. We refer the reader to (Yetik [2004]) for a more comprehensive discussion of our line-source models and 
results. As a future work it is possible to extend our models to EEG. 
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Linearly Constrained Minimum Variance Source Imaging using Multiresolution Cortical Bases 

T. Limpiti*, B.D. Van Veen*, R.D. Nowak*, and R.T. Wakai + 

Departments of Electrical and Computer Engineering and Medical Physics 

University of Wisconsin-Madison, USA 

Recent efforts to represent non-focal source activity in MEG analysis have focused on multipole expansions or cortical patch models, e.g., [1,2]. In 
this paper we present an approach for representing both focal and non-focal source activities using multiresolution expansions defined on the cortical 
surface, and present a modification of the linearly constrained minimum variance (LCMV) method for imaging activity in a basis function 
representation. A simple set of 2-D raised cosine basis functions with different radii are constructed on the nonuniformly spaced grid of points 
representing the flattened cortical surface. The raised cosine weighting applied to each grid point is determined based on the distance to the centroid 
of the basis. The centroids are chosen to achieve approximately 50% overlap between adjacent bases. The LCMV method is used to estimate the 
expansion coefficients for this set of basis functions, which are in turn used to generate a map of source activity. We show that these basis functions 
enable the LCMV approach to identify patches of coherent cortical activity that are missed by conventional LCMV. 

[1] Jerbi, K., Mosher, J.C., Nolte, G., Baillet, S., Gamero, L. and Leahy, R.M. (2002) “From dipoles to multipoles: parametric solutions to the inverse 
problem in MEG,” Proc. Biomag2002. 

[2] Kineses, W.E., Braun, C., Kaiser, S., Grodd, W., Ackermann, H., and Mathiak, K. (2003) “Reconstruction of extended cortical sources for EEG 
and MEG based on a Monte-Carlo-Markov-Chain estimator,” Hum. Brain Mapp ., 18:100-110. 


A Semi-Parametric Method for Source Localization using Local Basis Expansions 

H. Maniar, A. Polyakov, and P. P. Mitra 
Cold Spring Harbor Laboratory, USA, Cornell Medical College, USA 

Introduction A new semi-parametric technique is proposed for localization of focal sources distributed in a smooth underlying current Unlike other 
dipole-fitting approaches, the strength of this approach lies in that it does not require the user to assume the total number of dipoles a priori, and 
dipoles in separate spatial locations are fit separately rather than simultaneously. The method is similar in spirit to Thomson's F-Test for detection of 
line components in mixed spectra. 

Methods A local region of interest (ROI) is selected, and a generalized eigenvalue problem dependent on the leadfields and the ROI is solved. 
Measurements and leadfield are re-expressed utilizing this new eigenbasis. The transformed leadfields are orthogonal and highly localized to the 
ROI. Modeling using point dipoles, the transformed parametric model is viewed as a linear regression model and a local dipole fit performed. We 
provide explicit expressions for the dipole moment estimate and an F-statistic, for significance of a non-zero dipole component within the ROI. While 
applied in the time-domain, the technique applies to the frequency-domain too. 

Results The technique is demonstrated on a MEG concentric spherical head model. Tests show that the technique exhibits a super-resolution 
phenomenon, i.e. the ability to resolve two dipoles with a separation distance smaller than the ROI. 

Discussion A statistical parametric map of the F-statistic conveniently allows the detection of dipoles as peaks of the map. While first moment 
estimation is examined, an extension for second moment (quadratic) estimation is possible. 

Acknowledgements This research was supported by the Cold Spring Harbor Laboratory (internal funds) and the Human Frontiers Science Project 
(HFSP) #RGP07/2003. 


605 



Graphical Models for Electromagnetic Source Imaging 

Srikantan S. Nagarajan, Maneesh Sahani, Kensuke Sekihara and Hagai Attias 
Biomagnetic Imaging Laboratory, University of California, San Francisco, USA 

A tomographic approach to E/MEG data inversion is to recover the time dependent moments and magnitudes of dipole sources located on a grid of 
voxels that represent the entire brain volume and accounts for measured electromagnetic signals. Due to non-uniqueness, to identify the correct 
source distribution, additional assumptions must be made about the statistical structure of the sources and the correlations among them. Existing 
source localization algorithms used in ESI generally assume implicitly (or in some cases explicitly) that the source activity at different brain locations 
are independent, or that the correlation structure between sources are known. Here, we present and demonstrate a new approach to ESI, based on 
probabilistic inference that can be represented by a graphical model. In this approach, one considers a whole class of correlation structures between 
the sources, and uses the sensor data to assign a probability score (a.lc.a. evidence) to each structure. Spatiotemporal imaging is then performed by 
inferring source activity from data, taking into account all correlation structures weighted by their score. This procedure results in a novel class of 
imaging algorithms, that make use of variational approximations to the expectation-maximization (EM) algorithm, and yields robust and accurate 
results when compared to existing methods [1]. 

[1] M. Sahani and S.S. Nagarajan. Reconstructing MEG sources with unknown correlations. Advances in Neural Information Processing Systems 16, 
2004 (Eds: S. Thrun et al). Cambridge, MA: MIT Press. 
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Detecting and localizing brain connectivity free of artifacts from volume conduction exemplified for 

MEG data of human alpha rhythm 

G. Nolte, T. Holroyd, F. Carver, R. Coppola, M. Hallett 
NIH, Bethesda, Maryland , USA 

Interpreting interactions between MEG/EEG channels or estimated source amplitudes in terms of brain connectivity can be hazardous because ‘self- 
interaction’ mediated by volume conduction or incomplete source separation can not be excluded and will in general even dominate the measures. 
Our approach is to first isolate a measure which, if non-vanishing, is inconsistent with non-interacting sources. Such a measure is given by the 
imaginary part of the cross-spectrum/coherency between EEG/MEG channels. The derivation is based on the well-confirmed validity of the quasi¬ 
static approximation and essentially exploits that a signal is not time-lagged to itself. The big question is whether this ‘imaginary coherence’ is 
observable in (literally) real data. We observed in EEG data from hand movements clear imaginary coherence between channels over the left and 
right motor areas [1] necessarily reflecting true connectivity. We here address the question how to subsequently localize the interacting generators. In 
contrast to usual methods, where models consisting of rhythmic sources are fit to the fields , we directly fit a model for the (antisymmetric) imaginary 
part of the cross-spectrum consisting of N antisymmetrically coupled dipoles to the measured imaginary cross-spectrum. The method is illustrated for 
MEG data of human alpha rhythm in eyes-closed condition. The spectrum of the imaginary cross-spectrum clearly indicates the presence of at least 4 
necessarily interacting sources. Fits of 2 to 6 dipoles in a realistic volume conductor all resulted in locations scattered in the mesial part of the 
occipital lobe with an excellent explanation of the observed imaginary cross-spectrum. 

[l]Nolte, G., Bai, O.; Wheaton, L.; Mari, Z.; Vorbach, S.; Hallet, M. Estimating true brain interaction from EEG data using the imaginary part; 
submitted to Clinical Neurophysiology. 


Influence of forward method and head model selection on localization accuracy in EEG/MEG inverse 

problem 

Maciej Orzechowski (1), Zbigniew Dunajski (1,2) 
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2) Biophysics Laboratory, Warsaw Medical Academy, Poland 

The purpose of this work is to present the influence of the BEM (boundary element) and FEM (finite element) methods as well as head model 
selection on the localization accuracy in EEG/MEG inverse problem. The comparison was obtained by means of software package '’DIPOLE” 
developed in our group. Object oriented structure of the software provides that different types of forward methods (like standard BEM, Galerkin 
BEM and FEM) and head models (spherical and realistically shaped with different conductivity parameters) can be easily used to obtain the inverse 
solutions. The comparison of localization results is also done by the program what allows user to select a best method for a given head and source 
model. In case of BEM method the head model includes information about conductivities of the tissues, in case of FEM it can handle also 
conductivity inhomogeneities and anisotropy of the tissues. The different combinations of forward methods, head models and source parameters were 
investigated in this work (spherical and realistic 3-shell head models, standard and Galerkin BEM methods, FEM method). The inverse problem was 
solved by the Marquardt nonlinear search with the initial step fitting procedure. The results show that localization accuracy significantly increases 
using Galerkin BEM method for isotropic models and FEM method for the models with inhomogeneities, especially for deep located sources. The 
work proved also that there are special cases when FEM method does not improve the solution when compared to Galerkin BEM or even standard 
BEM method. The computational cost of every method is also presented and how it is dependent on head model complexity. 
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Performance of an MEG Adaptive Beamformer Technique in the Presence of Additive Low-Rank 

Interference 

Kensuke Sekihara 1 , Srikantan S. Nagarajan 2 , David Poeppel 3 , Alec Marantz 4 
Tokyo Metropolitan Institute of Technology, Japan, University of California, San Francisco, USA, University of Maryland, USA, 

4 Massachusetts Institute of Technology, USA 

Neuromagnetic measurements are often contaminated by various types of overlapping external interference even when the measurements are 
performed in a magnetically shielded room. In this paper, we analyze the influence of external interference on the adaptive beamformer[l-2] 
reconstruction results. In our analysis, we assume that the interference has the following two properties: First, it is additive and uncorrelated with 
brain activity. Second, its temporal behavior can be characterized by a few distinct activities, and as a result, the spatio-temporal matrix of the 
interference has a few distinctly large singular values. Namely, the interference can be modeled as a low-rank signal. Under these assumptions, our 
analysis shows that the adaptive beamformer techniques are insensitive to interference when its spatial singular vectors are so different from a lead 
field vector of a brain source that the generalized cosine between these two vectors is much smaller than unity. Four types of numerical examples 
verifying this conclusion are presented. 

[1] Robinson S.E., Vrba J. 1999. Functional neuroimaging by synthetic aperture magnetometry (SAM), Recent Advances in Biomagnetism, 
Yoshimoto T. et al., Eds., Sendai, 302-305. 

[2] van Veen B.D., van Drongelen W., Yuchtman M., Suzuki A., 1997, Localization of brain electrical activity via linearly constrained minimum 
variance spatial filtering, IEEE Trans. Biomed. Eng., 44, 867-880. 


Asymptotic SNR of Scalar and Vector Minimum-Variance Beamformers for Neuromagnetic Source 

Reconstruction 

Kensuke Sekihara 1 , Srikantan S. Nagarajan 2 , David Poeppel 3 , Alec Marantz 4 
Tokyo Metropolitan Institute of Technology, Japan, University of California, San Francisco, USA, University of Maryland, USA, 

4 Massachusetts Institute of Technology, USA 

The minimum-variance (MV) beamformer, which was originally developed in the field of array signal processing and has been successfully applied 
to neuromagnetic source reconstruction problems. However, the MV beamformer in its original form cannot be directly applied to neuromagnetic 
source reconstruction, and the method should be extended to incorporate the three-dimensional vector nature of the electromagnetic sources. So far, 
two types of extensions - the scalar[l]- and the vector[2]-type extensions - have been proposed. This paper discusses the asymptotic SNR of the 
outputs of these two types of beamformers. We first show that these two types of beamformers give exactly the same output power and output SNR if 
the beamformer pointing direction is optimized. We then compare the output SNR of the beamformer with optimum direction to that of the 
conventional vector beamformer formulation where the beamformer pointing direction is not optimized. The comparison shows that the beamformer 
with optimum direction gives an output SNR superior to that of the conventional vector beamformer. Numerical examples validating the results are 
presented. 

[1] Robinson S.E., Vrba J. 1999. Functional neuroimaging by synthetic aperture magnetometry (SAM), Recent Advances in Biomagnetism, 
Yoshimoto T. et al., Eds., Sendai, 302-305. 

[2] van Veen B.D., van Drongelen W., Yuchtman M., Suzuki A., 1997, Localization of brain electrical activity via linearly constrained minimum 
variance spatial filtering, IEEE Trans. Biomed. Eng., 44, 867-880. 
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Comparison between Minimum-Variance Spatial filter and sLORETA 

1 *2 • *2 
Kensuke Sekihara , Maneesh Sahani , Srikantan S. Nagarajan 

Tokyo Metropolitan Institute of Technology, Japan, University of California, San Francisco, USA 

Minimum-variance spatial filter [1] is a representative adaptive spatial filter technique, which has been successfully applied to neuromagnetic source 
reconstruction problems. Standardized low resolution brain electromagnetic tomography (sLORETA) [2] was recently proposed, and it is one variant 
of the least-squares-based minimum-norm source reconstruction method. This paper first formulates sLORETA as a non-adaptive spatial filter, and 
then compares the performances of minimum-variance spatial filter and sLORETA on a unified basis of spatial filtering. In this comparison, 
performance measures such as the resolution kernel and the source detectability are theoretically defined and numerical examples of these 
performance measures are provided in our computer simulation. Our analysis and subsequent numerical experiments reveal that the minimum- 
variance spatial filter has the spatial resolution much higher than that from sLORETA. However, sLORETA outperforms minimum-variance method 
when highly-correlated sources exist. These methods are then applied to somatosensory and auditory evoked measurements to confirm these results. 

[1] Robinson S.E., Vrba J. 1999. Functional neuroimaging by synthetic aperture magnetometry (SAM), Recent Advances in Biomagnetism, 
Yoshimoto T. et al., Eds., Sendai, 302-305. 

[2] Pascual-Marqui R.D. 2002. Standardized low resolution brain electromagnetic tomography (sLORETA): technical details, Methods & Findings in 
Experimental Clinical Pharmacology, 24D, 5-12. 


Hierarchical Variational Bayesian Method for MEG 
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! ATR Computational Neuroscience Laboratories, 2 CREST, JST, 
technology Research Lab., Shimadzu Co., 4 National Institute for Physiological Sciences, JAPAN 

In this article, we propose a new hierarchical Bayesian method to solve the MEG inverse problem. In our method, the variance of the source current 
at each source location are considered unknown parameters and estimated from the observed MEG data by introducing a hierarchical prior on the 
variance. The fMRI information can be imposed as prior information on the variance distribution rather than the variance itself so that it gives a soft 
constraint on the variance. A spatial smoothness constraint can also be implemented as a hierarchical prior. Because of the hierarchical prior, the 
estimation problem becomes nonlinear, so the approximate posterior distribution of the source current is calculated by using the Variational Bayesian 
method. The proposed method provides a unified theory to deal with three different situations: MEG with no other data, MEG with MRI data, and 
MEG with both MRI and fMRI data. The advantages of our method over the standard linear filters are confirmed in simulation studies using a single 
source and realistic visual responses. It is also shown that accuracy of our method improves as MRI and fMRI information becomes available. 
Simulation results demonstrate that our method can appropriately estimate the source currents even if fMRI data include inaccurate information, 
while the Wiener filter method is seriously deteriorated by inaccurate fMRI information. 
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Computation of MEG in an Anisotropic Volume Conductor Using FEM Employing Whitney Elements 

I.O. Tanzer 1 , J. Nenonen , S. Jarvenpaa 2 , E. Somersalo’ 

Laboratory of Biomedical Engineering; Electromagnetics Laboratory; Institute of Mathematics, Helsinki University of Technology, 

Espoo, FINLAND, Elekta Neuromag Oy, Helsinki, FINLAND 

The human brain constitutes a complex anisotropic volume conductor. The anisotropy is especially manifested in the white matter where the ratio of 
conductivities along and perpendicular to the nerve fibers is 10:1 and also in gray matter with a ratio of 2:1 due to parallel alignment of pyramid cells 
perpendicularly to the cortex surface. In this work, the effect of a simplified cortex anisotropy is studied on the forward and inverse solutions of 

magnetoencephalography (MEG). A spherical finite-element head model is employed, and the bioelectric currents are 
modeled with Whitney basis functions. In the forward simulations, MEG signals in the Elekta Neuromag system are 


Scalp 
Skull 
CSF — 
Corte< 


ft O ± ft i> 



computed from a shallow and from a deep source location. Both focal (1 cm 2 ) and distributed (5 cm 2 ) patches of cortex 


are excited with a sinusoidal current. The solutions in the anisotropic and isotropic volume conductors are compared to 
evaluate the effect of anisotropy in the forward solutions. Both the isotropic and anisotropic forward solutions are then 
utilized to test the linear inversion [1]. White noise or measured spontaneous brain signals are superimposed on the 
forward solutions, and covariance of the noise and data are evaluated. Spatial filtering (beamforming) is applied in the inversion with analytical 
sphere formula to test if the anisotropy has an effect in the reconstructions. 

[1] Mosher, J., Baillet S, Leahy R., Equiv. of Linear Approaches in Bioelectromagnetic Inverse, Solutions IEEE Work. Stat.Sig. Proc., 2003 


Fidelity-Linearity Optimization for Bioelectromagnetic Inverse Problem 

Y. Terazono 1? 2 , A. Matani l ’ 2 

1 The University of Tokyo, Japan 2 National Institute of Information and Communications Technology, Japan 

An inverse method for solving bioelectromagnetic inverse problem is presented. The method is constructed so that the composition mapping of the 
forward and the generalized inverse mapping will approximate the identity mapping in two properties. One is basis fidelity, and the other is linearity. 
A mapping is said to have basis fidelity if a certain basis is a set of fixpoints of the mapping. These two properties are mathematically important 
because they are the necessary and sufficient conditions for identity. They are also important for practical usage. First, basis fidelity corresponds to 
the estimation performance for a set of basic patterns like magnetic fields or electric potentials generated by single current dipole. Secondly, linearity 
corresponds to how well the performance for basic patterns is kept for their linear combinations. Note that basis fidelity is extended for vector fields. 
Vector extended weighted 11-norm and 12-norm minimization with suitable weight settings are introduced to realize extended basis fidelity and 
linearity, respectively. Then, two cost functions are combined to form Fidelity-Linearity Optimization. The principal parameter of this convex 
optimization problem is the proportion of the fidelity cost to the linearity. Numerical examples showed that the proposed method provides linear to 
nonlinear estimations according to its proportion parameter. When the parameter was fixed, the estimated solutions spread reflecting the source 
distributions. Thus, the proposed method showed novel linear and nonlinear incorporated properties. However, there is still room for discussing 
optimal settings of its proportional parameter and weighting coefficients. 
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Evaluation of sLORETA for MEG Data in the Presence of Noise and Multiple Sources 

M. Wagner, M. Fuchs, J. Kastner 
Compumedics / Neuroscan, Germany 

Standardized Low Resolution Brain Electromagnetic Tomography (sLORETA) [1] can be used to compute statistical maps from MEG and EEG data 
that indicate the locations of the underlying source processes. These maps are derived by performing a location-wise inverse weighting of the results 
of a Minimum Norm Least Squares (MNLS) analysis with their estimated variances. In this contribution, we evaluate the performance of the method 
for MEG data under the presence of noise and with multiple, simultaneously active sources. An analysis of the EEG case has been presented before 
[2]. Simulated data containing one and two sources at different locations and with different strengths have been computed for a helmet MEG setup. A 
spherical volume conductor model was used. Data were disturbed using Gaussian noise of different levels. sLORETA analyses have been performed 
using the CURRY software (Compumedics / Neuroscan, El Paso, TX, USA) and their results have been compared with other dipole and current 
density results. While the method is capable of localizing single sources even in the presence of noise, multiple simultaneous sources can only be 
separated if they are not too close and their strengths are not too different. 

[1] Pascual-Marqui, A.D. 2002. Standardized low resolution brain electromagnetic tomography (sLORETA): technical details. Methods & Findings 
in Experimental & Clinical Pharmacology 24D, 5-12. 

[2] Wagner, M., Fuchs, M., Kastner, J. 2004. Evaluation of sLORETA in the Presence of Noise and Multiple Sources. Brain Topography (submitted). 


An analysis of MEG data by Hierarchical Variational Bayesian Method 
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We have proposed a variational Bayesian (VB) method to solve MEG inverse problem. Our method has shown better accuracy and spatial resolution 
than conventional linear inverse methods in computer simulation. In this study, we show an application of our method to real experimental data. In 
conventional methods, the prior current variance is directly given from other measurements, e.g., fMRI. This implies that the current source position 
is strongly limited to the active region of the fMRI measurement. On the other hand, our method introduces a soft constraint on the variance, so the 
estimation result is more sensitive to the accuracy of the forward model. It is difficult to construct accurate forward models because it requires precise 
measurement of the head position relative to the MEG sensors. We investigate how inaccurate head position deteriorates the estimation result. In this 
article, a scheme for correcting the head position is also presented. On the basis of our method, we analyze MEG data of a visual stimulus 
experiment, in which right-half, right-upper and right-lower quadrant visual stimuli were presented to the subject. We show that our method is able to 
estimate the difference of the current source position depending on the visual stimuli. 
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